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(54) Customization of electronic content based on consumer attributes 



(57) A system and method provide for the interpre- 
tation and augmenting of structured documents elec- 
tronically delivered to an individual consumer's compu- 
ter using consumer profiles developed from and main- 
tained with information reflecting the consumer's online 
and offline transactions, by selecting the variable con- 
tent alternatives encoded in the structured documents 
that most closely match the consumer's profile. The con- 
sumer profiles are logically controlled by the consumer's 
computer, thus providing for enhanced security over in- 
formation that is personal and confidential to the con- 
sumer, yet still allowing third parties such as web sites 
and others electronically delivering structured docu- 
ments to the consumer to have such documents cus- 
tomized based on the consumer's profile. The consumer 
profile includes hierarchical attribute vectors which en- 
code attributes of a consumer at progressively higher 
levels of abstraction, and allowing for querying of any 
combination of abstracted data or abstracted attributes 
of a consumer. The consumer profiles are updated with 
a process that reflects the relevance of each transaction 
to the consumer's profile, and accommodates a decay 
in the influence of transactions over time. A selection 
process for selecting content allows for multiple items 
of content to be selected for sequenced display to the 
consumer, through a limited presentation space. Addi- 
tionally, financial documents are arranged for display in 
a manner which facilitates the inclusion of promotional 
information into individual transaction lines, so as to take 
advantage of the consumer's visual scanning of the 
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Description 

Cross Reference to Related Applications 

5 [0001] This application is a continuation in part of application serial no. 09/241 ,546 : entitled HIERARCHICAL MOD- 
ELS OF CONSUMER ATTRIBUTES FOR TARGETING CONTENT IN A PRIVACY-PRESERVING MANNER filed on 
February 1 , 1999, which is a continuation in part of application serial no. 09/235,610, entitled TIC: CUSTOMIZATION 
OF ELECTRONIC CONTENT BASED ON USER SIDE INTERPRETATION OF ONLINE REPORTS, filed on January 
20, 1999, which is a continuation of provisional application serial no. 60/091 ,979 filed on July 7, 1998, all of which are 

10 incorporated by reference herein. 

A. Background 

1 . Field of the Invention 

15 

[0002] The present invention relates to the creation and maintenance of models of consumers, based upon trans- 
actional data extracted from structured information received via electronic channels and viewed by the consumer and 
the use of those models to aid in presenting targeted content, such as advertising or special offers, in a way that does 
not compromise the consumer's privacy. 

20 

2. Background 

[0003] Consumers today receive a tremendous amount of information via electronic systems. Many consumers ac- 
cess the Internet or private networks to review information such as personal bank statements, credit card reports, stock 

25 portfolios, online account balances, online transaction receipts, commercial web sites, personal home pages, and the 
like. Some of this structured information reflects consumer behavior that occurs "off-line" such as credit card transac- 
tions made at retail establishments, debit, check or other bank transactions, while some of the structured information 
represents entirely online behavior (including both commercial and non-commercial behavior). Online behavior also 
includes behavior that is not directly commercial in nature, such as click-through behavior in using the Internet. This 

30 type of behavior is captured in structured documents such as URL (Uniform Resource Locator) history files stored on 
the user's computer. 

[0004] A key goal of most businesses is to target consumers with products and services that is of interest to the 
consumer based on their interests, preferences, or demographics. Thus many attempts have been made to determine 
consumer's interests and provide content (e.g. promotions, advertisements) to products and services that match such 
35 interests. 

[0005] Historically, for many types of their transactions, such as credit or banking transactions, consumers received 
printed statements from credit card companies, bank, and other financial institutions. The paper statement themselves 
typically only contain limited information identifying each transaction, its amount, date, and so forth. To enhance the 
advertising opportunities provided when the consumer is reviewing their printed statement, these institutions often add 
40 additional promotional insert documents to the statement, in the hope that the consumer will review these separate 
documents and be motivated to purchase the advertised products or services. Only limited targeting of the content of 
the advertising inserts has been possible in the past. Further, conventional credit card and bank statements do not 
specifically structure their content to take advantage of the visual scanning processing that takes place when a con- 
sumer reviews the statement. 

45 [0006] With the rise of consumer use of the Internet, advertisers have sought out ways to target promotional infor- 
mation to consumers viewing web pages and the like. On the World Wide Web, "banner" advertising is typically used 
to promote products or services to consumers viewing various web pages. Many web sites attempt to target banner 
advertising based on the content of a user's input, typically a search request, or other consumer input that is processed 
by the web site. However, targeting based on a single consumer input does not necessarily result in banner advertise- 

50 ments that actually are of interest to the consumer. This is because consumers frequently search for or desire infor- 
mation that is not necessarily related to their general interests, preferences, or demographics. Even when consumers 
are searching for information related to their interests, their online behavior may only reflect a small portion of their 
interests, and the relative importance of certain interests while online may be vastly different than the importance of 
the same interest when the consumer is off-line. Thus, online behavior may not be indicative of interests that are likely 

55 to lead to transactions being consummated online. 

[0007] Some web sites attempt to target consumers by storing databases that contain some type of identifying data 
for each consumer. When a consumer accesses a web site coupled to the database, a targeted advertisement may 
be selected based on the consumer's data. However, this approach suffers from a loss of privacy to the consumer 
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since personal information about some of their behaviors (typically only their click-throughs at the web site) are record- 
ed, as this information is stored in the database without their consent or control. At present, the owner of the web site 
is typically free to use this consumer information as they see fit. In addition, the consumer information is often based 
on very limited information about the consumer that either does not accurately reflect their true interests and preferences 

5 or does not change automatically in response to both off-line and online consumer behavior. Furthermore, this infor- 
mation is normally limited to a consumer's prior interaction with that particular web site. While : new services are arising 
which attempt to share consumer information among unrelated websites, such mechanisms may violate consumer 
privacy and potentially consumer protection laws in various jurisdictions throughout the world. 
[0008] Accordingly, it is desirable to provide a system, method, architecture and various software products that can 

10 augment structured documents that are received and reviewed online by a consumer using historical behavioral infor- 
mation about the consumer, including either online or off-line behavior, while ensuring the personal privacy of each 
consumer. 

[0009] Further, it is desirable to provide a system, method, architecture and various software products that can take 
advantage of the visual scanning process and cognitive associations of the consumer when scanning transactions in 
15 a credit card, bank statement or other structured financial document, to ensure that promotional material is viewed at 
the same time, and that the viewing is done in a manner that is not perceived by the consumer as being intrusive. 

B. Summary of the Invention 

20 [0010] The present invention overcomes the limitations of conventional models and targeting methods for delivering 
custom content to consumers that matches their interests, preferences, demographics, or psychographics. Existing 
systems make use of data warehouses that aggregate information from a variety of sources, create demographic 
segments with associated consumer preferences, decide which segment or segments a consumer occupies, and sends 
information relevant to that segment to the consumer. 

25 [0011] The present invention builds upon the analyzing or "interpreting" of structured documents delivered electron- 
ically to the consumer for information to create a consumer profile of the consumer based on a broad spectrum of both 
online and offline behaviors and transactions. In particular, a detailed model of the consumer is built using mathematical 
functions that map from the specific transactions of the consumer to estimates of the relevancy of certain attributes to 
the consumer. These models can be used to order a number of pieces of conditional content with respect to how well 

30 they match the attributes of the consumer, and hence how well they may appeal to the consumer's interests, prefer- 
ences, psychographics, or demographics. Display methods in an illumination process, which augments structured 
documents being electronically delivered to the consumer with the conditional content, allow for the consumer to view 
the most appropriate piece of content first (by its ordering), followed by the next most appropriate piece of content if 
the consumer so indicates, and so forth. A content rotator may be used by a consumer to selectively view an ordered 

35 set of conditional content. 

[0012] The consumer profile is continually updated with information extracted from electronically delivered structured 
documents and from consumer behavior such as selection of content, and thus reflects an accurate and current as- 
sessment of the consumer's interests, preferences, and demographics. The consumer profile may also be updated 
with information received about merchants, products, or consumer model parameters received from remote sources 

40 over a network. Changes in a consumer's interests, preferences, and demographics are assumed to be reflected by 
changes in their behaviors, as reflected through commercial and non-commercial transactions. 
[0013] The consumer attributes may be defined in a hierarchical model, with aggregated attributes having values 
derived from lower level attributes (either themselves aggregated, or base level attributes). In this way arbitrarily com- 
plex queries can be evaluated against the model to target very specific consumers. The hierarchical model further 

45 allows recursive selection of conditional content, with initial selection of content using higher levels of aggregated 
attributes, and subsequent selection using a combination of lower levels of attributes on which the higher levels are 
based and consumer expression of interest in each level of selected content. 

[0014] The consumer attributes are updated by the relevancy of individual transactions. Relative recency of trans- 
actions, and the probabilities of transactions occurring or not occurring may also be used to update the attributes. 
50 [0015] One embodiment of the present invention, as described herein is an integrated system of software, knowledge 
bases, and processes which enables the following functionality: 

• creation of a model consist of a vector of data values, such that each of the values represents some attribute of 
the consumer: psychographic qualities, specific interest attributes, income level, etc., using facts in a consumer 

55 database of the consumer, plus a mathematical framework for interpreting the vector of values. 

• grouping of elements of the model into a hierarchical framework that aids in a successive refinement of consumer 
attributes. 

• the use of the database and models to target and personalize otherwise generic content delivered to the consumer 
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in electronic form and including conditional content selected and ordered based on the consumer model, without 
the creator of the content having any knowledge of the consumer his transactions or the results of models. 
• presentation mechanisms for enabling the consumer to selectively view the conditional content according to its 
ordering. 

5 

[0016] In another aspect the present invention overcomes the limitations of conventional mechanisms for ensuring 
that consumers view targeted content that matches their interests, preferences, or demographics by placing the tar- 
geted content in structured documents in an arrangement that requires the consumer's eyes to encounter the content 
while performing the task of reviewing their banking, credit card, or other structured statements or documents. The 
10 structured document is organized as a set of transactions; each transaction is assigned to a transaction line, and the 
transaction line is rendered on a display device in an arrangement which positions the targeted content in the transaction 
line. 

[0017] In one embodiment, the transaction lines include the placement of the targeted content in a space that is 
visually between two important information fields of the transaction necessary for the consumer to link cognitively 

'5 together in order to understand the transaction or manipulate the transaction. An exemplary arrangement of this type 
for a credit card statement (or similar financial statement) positions the targeted content between the merchant name 
of the transaction and the transaction amount. This arrangement ensures that the consumer will see the targeted 
content as a matter of course during the routine scanning of the statement for errors. In bank statements, a similar 
arrangement positions the targeted content between a description of the transaction (e.g. check number, transaction 

20 type, etc. ) and the transaction amount. In other types of documents, the transactions may be information oriented, 
and the visual scanning process is done in order to location information related to the transaction. Here the information 
fields of the transaction may be dispersed with the promotional content displayed between two important information 
fields for this purpose. 

[001 8] In another aspect of the present invention, financial statements are displayed in a manner that facilitates the 

25 display of additional promotional content in the context of individual transactions, without excessively crowding the 
financial statement. In one embodiment of this aspect, each transaction is associated with detailed transaction infor- 
mation, such as contact information for the merchant (e.g. address, telephone number, web site) and a transaction 
receipt showing the details of the transaction (e.g. items purchased, services provided, travel itinerary, etc.). First, the 
financial statement is shown with sufficient information fields to identify the transaction, such as merchant name, date, 

30 and amount. One of these displayed information fields is hyperlinked to the detailed transaction information. When the 
user clicks on the hyperlink the detailed information is displayed in place of (or in addition to) the initially displayed 
information fields. In addition, along with the initially displayed information fields, promotional content is display in 
association with particular transactions. Because the detailed transaction information is not displayed until and unless 
requested, there is sufficient display space on the display device to display the promotional information. 

35 [0019] in another embodiment of this aspect of the invention, when a financial statement is displayed it may contain 
transactions related to many different types of categories, such as air travel, restaurants, automobile expenses, and 
the like. Along with the display of these transactions there are displayed a plurality of category selectors, each of which 
is associated with a specific category. The categories preferably are derived from the consumer's own particular spend- 
ing habits, so that each consumer views category selectors for categories which are meaningful and related to their 

40 purchasing behavior. When the consumer selects a category selector, only transactions from that category are dis- 
played. For selected ones of these now displayed categories, promotional content is displayed, preferably in association 
with individual ones of the transactions. By providing the category selectors which automatically display transactions 
in categories meaningful the consumer, the consumer is very likely to want to view these categories in order to assess 
their purchasing behavior. This then affords the second display opportunity by which the additional promotional content 

45 is displayed. 

[0020] Specifically, according to a first aspect of the present invention there is provided a method of maintaining a 
consumer profile of a consumer, the method comprising: inputting a structured document to a consumer computer; 
processing a structured document at the consumer computer to identify transactional behavior of the consumer; and 
updating the consumer profile with respect to the identified transactional behavior. 

so [0021 ] Advantageously the structured document input to the consumer computer is received from another computer. 
Alternatively, the structured document input to the consumer computer is generated locally at the consumer computer. 
[0022] Advantageously updating the consumer profile further comprises: identifying at least one form field of the 
structured document filled in by the consumer; and automatically updating the consumer profile with respect to the 
data contained in the at least one form field. 

55 [0023] Advantageously updating the consumer profile further comprises: identifying an item of the structured docu- 
ment that the consumer clicked on; and automatically updating the consumer profile with respect to the item that was 
clicked on. 

[0024] Advantageously updating the consumer profile further comprises: identifying at least one of a merchant or a 
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product in the transactional behavior; and updating the consumer profile with respect to the identified a merchant or 
product. 

[0025] Advantageously updating the consumer profile further comprises: retrieving information about merchants or 
products related to transactions of the consumer over a network from sources remote to the consumer computer. 
5 Preferably the information retrieved contains metadata related to structured documents or consumer profiles. 

[0026] Advantageously updating the consumer profile further comprises: updating the consumer profile as a function 
of a date or time of a transaction included in the transactional behavior. 

[0027] Advantageously updating the consumer profile further comprises: updating the consumer profile as a function 

of an amount of a transaction included in the transactional behavior. 
w [0028] Advantageously the structured document is an online transaction report. Preferably the transaction report is 

one of a group consisting of: an online credit card statement, an online bank statement, an online bill, an investment 

portfolio, a medical report, a receipt, a file containing a list of Uniform Resource Locators, or a tax form. 

[0029] According to a second aspect of the present invention there is provided a method of maintaining a consumer 

profile of a consumer, the method comprising: inputting an online financial statement to a consumer computer; process- 
's ing the financial statement at the consumer computer to identify transactional behavior of the consumer; and updating 

the consumer profile with respect to the identified transactional behavior. 

[0030] Advantageously the structured document input to the consumer computer is received from another computer. 
Alternatively the structured document input to the consumer computer is generated locally at the consumer computer. 
[0031] Advantageously updating the consumer profile further comprises: identifying at least one form field of the 
20 structured document filled in by the consumer; and automatically updating the consumer profile with respect to the 
data contained in the at least one form field. 

[0032] Advantageously updating the consumer profile further comprises: identifying an item of the structured docu- 
ment that the consumer clicked on; and updating the consumer profile with respect to the item that was clicked on. 
[0033] Advantageously updating the consumer profile further comprises: identifying at least one of a merchant or a 
25 product in the transactional behavior: and automatically updating the consumer profile with respect to the identified a 
merchant or product. 

[0034] Advantageously updating the consumer profile further comprises: retrieving information about merchants or 
products related to transactions of the consumer over a network from sources remote to the consumer computer 
Preferably the information retrieved contains metadata related to structured documents or consumer profiles. 
30 [0035] Advantageously updating the consumer profile further comprises: updating the consumer profile as a function 
of a date or time of a transaction included in the transactional behavior. 

[0036] Advantageously updating the consumer profile further comprises: updating the consumer profile as a function 
of an amount of a transaction included in the transactional behavior. 

[0037] According to a third aspect of the present invention there is provided a system for maintaining a consumer 
35 profile of a consumer, the system comprising: a consumer profile including a database comprising facts derived from 
online and off-line transactions of the consumer, and a consumer model describing attributes of the consumer and 
derived from the transactions, the consumer prof ile securely stored to prevent access to the consumer profile by anyone 
other than the consumer; and an interpretation module that processes structured documents associated with transac- 
tions of the consumer to derive facts related to the transactions, that selectively retrieves additional facts related to the 
40 transactions from remote sources across a network, and that updates the consumer profile with the derived and re- 
trieved facts. 

[0038] According to a fourth aspect of the present invention there is provided a method of updating a model of 
consumer attributes, comprising: retrieving a plurality of transactions; determining a measure of relevancy of each 
transaction to at least one attribute of a consumer; and updating the at least one attribute as a function of the relevancy 

45 of each of the plurality of transactions. 

[0039] Advantageously determining a measure of relevancy of each transaction comprises determining the condi- 
tional probability of each transaction occurring given a value of the attribute. Preferably updating the at least one 
attribute is performed using Bayes Theorem using the conditional probability of each transaction. 
[0040] Advantageously updating the at least one attribute further comprises: assigning each transaction to a time 

50 period in an ordered sequence of time periods; for any time period to which no transaction is assigned, establishing a 
measure of relevancy for the attribute for no transaction occurring within the time period; and updating the at least one 
attribute according to the ordered sequence of time periods, using the measures of relevancy of transactions in time 
periods in which transaction occur, and the established measure of relevancy for the attribute for time periods in which 
no transactions occur. 

55 [0041] Advantageously determining a measure of relevancy of each transaction comprises determining the condi- 
tional probability of each transaction occurring given a value of the attribute. 

[0042] Advantageously updating the at least one attribute is performed using Bayes Theorem using the conditional 
probability of each transaction. 
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[0043] Advantageously the measure of relevancy of a transaction is a function of a monetary amount associated 
with the transaction. 

[0044] Advantageously the measure of relevancy of a transaction is a function of at least one party to the transaction. 
[0045] According to a fifth aspect of the present invention there is provided a computer readable structure for modeling 
s the attributes of a consumer, comprising: a base vector comprising a plurality of base level attributes of the consumer 
each base level attribute having a value; and at least one hierarchical vector comprising at least one aggregate attribute 
associated with a plurality of base level attributes of the base level vector, and having a value computed as a function 
of the values of the associated base level attributes. 

[0046] According to a sixth aspect of the present invention there is provided a method of displaying a financial state- 
10 ment on a display device to facilitate online review of the statement, the financial statement including a plurality of 
transactions, each transaction including at least two different information fields : the method comprising: displaying the 
first information field of the transaction in a first location on the statement; displaying the second information field of 
the transaction in a second location on the statement that is viewed subsequent to the first location when viewed by 
the consumer; and displaying promotional content at a third location between the first and second locations. 
15 [0047] Advantageously the first information field is a merchant name or logo, and the second information field is an 
amount related to the transaction. 

[0048] Advantageously the first information field is a name or logo or a merchant, and the second information field 
is a hyperlink to additional information related to the merchant. 

[0049] According to a seventh aspect of the present invention there is provided a method of displaying a financial 
20 statement including a plurality of transactions, each transaction associated with detailed transaction information and 
a plurality of information fields, the method comprising: for each transaction, displaying a predetermined limited number 
of information fields in a row, the displayed information fields sufficient for identifying the transaction without determi- 
nation of all detailed transaction information of the transaction; displaying for each a plurality of the displayed transac- 
tion, promotional content in the row of the transaction, between two displayed information fields of the transaction; for 
25 each transaction, forming one of the displayed information fields as a hyperlink to the detailed transaction information; 
and responsive to a user selection of a hypertinked information field of a transaction, displaying in a separate display 
window the detailed transaction information for the transaction. 

[0050] According to an eighth aspect of the present invention there is provided a method of displaying financial 
statements to provide for targeted promotional content, the method comprising: displaying a financial statement in- 
30 eluding a plurality of transactions, the transactions associated with a plurality of different categories; displaying a plu- 
rality of category selectors, each selector associated with a specific category of transactions; responsive to receiving 
a user selection of a category selector, displaying only transactions in the selected category; and for at least one 
displayed transaction in the selected category, displaying promotional content in association with the displayed trans- 
action. 

35 

C. Brief Description of the Drawings 



40 



50 



[0051] Fig. 1 is an illustration of a system in accordance with the present invention. 

[0052] Fig. 2 is an illustration of a sample structured document. 

[0053] Figs. 3A and 3B illustrate an illuminated credit card statement. 

[0054] Fig. 4 illustrates four example illuminations of a template document. 

[0055] Figs. 5 A and 5B illustrate examples of an illuminated web pages. 

[0056] Fig. 6 is an illustration of the client and server side architecture for a World Wide Web implementation. 

[0057] Fig. 7 is a flowchart of the client-side procedures for interpreting and illuminating web-based structured doc- 
uments. 

[0058] Fig. 8 is an illustration of a system for using hierarchical consumer models for illuminating documents. 

[0059] Fig. 9 is an illustration of a hierarchical attribute vector. 

[0060] Fig. 10 is a flow diagram of the illumination sorting process. 

[0061 ] Figs. 1 1 a and 1 1 b are tables illustrating updating of the attribute vector. 

[0062] Figs. 12a and 12b are tables illustrating updating of the attribute vector with decaying of older transactions. 

[0063] Figs. 13a and 13b are tables illustrating updating of the attribute vector including aggregated attributes. 

[0064] Fig. 14 is an illustration of an illuminated credit card statement with the content rotator. 

[0065] Fig. 15 is another illustration of the illuminated credit card statement after activation of the content rotator. 

[0066] Fig. 1 6 is another illustration of the illuminated credit card statement after activation of the content rotator. 

[0067] Fig. 17 is an illustration of the effect of hierarchical discrimination of content. 

[0068] Fig. 18 is another illustration of the effect of hierarchical discrimination of content. 

[0069] Fig. 19 illustrates an illuminated credit card statement. 

[0070] Fig. 20 illustrates a transaction information detail window. 
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[0071] Fig. 21 illustrates an illuminated credit card statement that includes vertical portal navigation. 

[0072] Fig. 22 illustrates a vertical portal for dining. 

[0073] Fig. 23 illustrates an alternative layout for vertical portal for dining. 

[0074] Fig. 24 illustrates a vertical portal for travel. 

[0075] Fig. 25 illustrates an alternative layout for a vertical portal for travel. 

I. Detailed Description 



A. Terminology 

10 

[0076] The present invention may be understood by use of the following terminology: 

• A "consumer" is a person or organization that uses a commodity or service. In this context, it may be an individual 
or a collection of individuals acting as a consuming unit (e.g. a household, a company); 

15 • "Content" is information presented in any form including text, graphics, video, and/or audio; 

• A "viewer" is the representative or representatives of the consumer unit to whom the content will be presented. 

• A "transaction" is an event where some amounts of resources or information are exchanged between the consumer 
and a vendor (or content provider) at some particular time or over some particular time interval; 

• A "transaction report" is an accounting of some group of consumer transactions which includes descriptive infor- 
20 mation identifying the consumer and consumer transactions; 

• A "model" is a mathematical function which attempts to predict the appeal to the consumer of future transactions 
based on interests and behavior derived from past transactions, and consists of an attribute vector and a set of 
mapping functions; 

• An "attribute vector" is a vector of data values such that each element of the vector represents an attribute of the 
25 consumer; 

• A "mapping function" is a mathematical function that is used to convert between factual information (such as data 
regarding a transaction or whether an individual is pregnant) and elements of an attribute vector; 

• "Targeting" is the selection of what material to present to a viewer based on interests, needs, capabilities, and 
other attributes of the consumer and/or viewer; 

30 • An "illumination" is a piece of targeted content, plus query information used to determine the degree to which the 
content matches the attributes of the consumer model. 

• "Personalization" means that the details of the content, and the manner in which the content is presented to a 
viewer, are tailored to the consumer and/or viewer: and 

• A "vendor" is an entity that sells or provides a product or service to a consumer. 

35 

B. System Overview 

[0077] Referring now to Fig. 1 . there is shown one embodiment of a system in accordance with the present invention. 
The system 100 includes: 

40 [0078] A local access device 1 02 with local memory, computing capability, persistent storage, a display, and a network 
connection (e.g. a personal computer with a modem connected to an Internet Service Provider or an intelligent television 
set-top box connected to a cable head-end). 

[0079] One or more TIC enabled data servers 104 on the network which download software and provide generic 
information (i.e. information not specific to any particular consumer) that can be used to parse transactional or behav- 
es ioral information in structured documents to construct the personal database, develop the models of the consumer and 
deliver pointers (or other references) to content alternatives whose selection is based on the consumer models or 
queries to the personal database. 

[0080] One or more servers 106a which distribute online transaction reports whose format is known to the system. 
[0081 ] One or more servers 1 06b which distribute content 1 08 which includes embedded tags and queries. A report 
so server and content server may be one and the same, as illustrated in Fig. 1 . 

[0082] An embodiment of the present invention for use with the Internet is further described below with respect to 
Fig. 6. 



C. Illumination and Interpretation 

55 

[0083] The present invention includes two major independent processes: interpretation and illumination. 

[0084] Interpretation refers to the process of parsing and analyzing reports and other structured documents used by 

TIC to construct the personal database and models of the consumer. The data sources, on which the model is based, 
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include any of the consumer's transaction reports which may be accessible to his local computing device either directly 
or via the network. Such reports may include credit card statements, bank statements, electronically presented bills, 
investment portfolios, medical reports, purchase receipts, tax forms, URL history files generated by the browser, forms 
filled in by the viewer and any other parsable and interpretable data about the consumer which may become available 

s in electronic form. Since such reports provide generally accurate and very detailed data on a wide range of an individ- 
ual's activities, both online and off-line, the models developed by TIC can be extremely accurate and complete. 
[0085] Report formats, including information necessary for parsing the report data and using the data to update the 
user-model, may be self-describing, i.e. embedded as metadata in the form of annotation tags in the report itself : or 
indexed by report source in some globally accessible database managed by the TIC operator. Virtually any sufficiently 

10 structured document available to the user's device can serve as a data source for TIC. These could include databases, 
logs, and reports maintained and/or generated directly by the user himself or by software running on the user's com- 
puter. Examples include databases and reports generated by personal finances software, tax preparation software, 
loan applications, calendars, URL history files generated by web-browsers, listings of software residing on the user's 
computer, and reports of currently active processes and windows on the user's computer. 

15 [0086] Because of the nature and level of detail of the information available in the TIC model, the user of TIC will 
require a high degree of privacy. The privacy that TIC provides is that no information collected, generated or inferred 
by TIC ever leaves the user's control. The data and inferences are used entirely on the user computer for the purpose 
of customizing and personalizing content for the user. With respect to TIC, personal information about the user only 
flows from external sources into the user computer (and the locally-controlled storage device). The user may use 

20 information presented by TIC to communicate back to content providers but that is done explicitly by the user inde- 
pendently of TIC. 

[0087] Referring to Fig. 2, there is shown a pictorial example of an example structured document, here a credit card 
statement 200, and the types of information that may be usefully extracted during interpretation in order to update a 
consumer profile. For a structured document such as a credit card report, interpretation extracts information about 

25 each transaction 202, such as the dateAime 204 of the transaction, the entity 206 with whom the transaction occurred, 
the location 206 of the transaction, and the amount of resources 210 (here measured in dollars) committed by the 
consumer to the transaction. The extracted information here can be used to both update the consumer model and to 
add new facts about the consumer's transactions, preferences, or interests to the database. 
[0088] Illumination is the process of annotating or replacing sections of documents or other media with (possibly) 

30 related multimedia content. Typically the new content expands on the information in the original content and/or provides 
a more interesting presentation of the information. As examples, the name of a company or product might be illuminated 
with a graphical logo or a hyperlink to a home page, a video presentation, or a presentation for some completely 
different product or company; an input field in a form may be illuminated by replacing it with the information being 
requested; an empty spot in a video broadcast may be replaced with a commercial. The choice of illumination for a 

35 given section may depend on the content of the section, the overall content of the document, the choices made for 
illuminating other sections, the context such as time and location in which the content is being viewed and attributes 
or profile of the viewer. The illumination may occur at any point in the production and delivery process for the media. 
The degree of flexibility and context dependence in the choice of illumination will depend on how early or late in the 
process the illumination occurs and how much information about the context is available to the illuminator. This flexibility 

40 ranges from static content (i.e. content that is the same for all viewers and contexts) to viewer-specific content (i.e. 
content which may vary for each viewer). 

[0089] Figs. 3A and 3B illustrate an example illumination of a credit card statement. In Fig. 3A there is shown a credit 
card statement 300 as it would appear on a consumer's computer without the benefit of illumination according to the 
present invention. Such a statement 300 would be transmitted electronically from the computer system of a credit card 
4 5 company (or comparable financial institution) to the consumer's personal computer for review, and perhaps payment 
of outstanding balances. This statement 300 simply lists each transaction 302 as a simple line of text, with the normal 
descriptors of date, merchant name, transaction identifier, location, and amount. The opportunities for targeted adver- 
tising based on these individual transactions are wasted. 

[0090] Now considering Fig. 3B, there is shown the credit card statement 304 after illumination in accordance with 
so the present invention. First, many of the simple text-only merchant names have been replaced by graphic icons 307 
illustrating the merchant's particular tradename or mark. This reinforces brand identity right at the point the consumer 
is reviewing a particular transaction with the merchant. Second, co-branding is facilitated by the inclusion in some 
transactions of a graphic icon 308 for second merchant's goods/services. For example, in the transaction line for the 
Chevron® charge, an icon 308 for Mercedes-Benz® cars is included. The selection of which second merchant's icon 
55 308 to include in the transaction is based on the consumer's profile which is used to select one of a number of variable 
content alternatives that are encoded in the credit card statement when it is received electronically. 
[0091] Third, illumination provides for the replacement of static text, such as the merchant's name, with a hyperlink 
31 0 to the merchant's web site (or any other web site, as determined by the selectable content alternative data). Thus, 
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here illumination transforms the otherwise static credit card statement into a dynamic document that allows the con- 
sumer to immediately access more information about a merchant during the process of reviewing the credit card state- 
ment. 

[0092] Fourth, illumination provides for the insertion of promotional content 312 into specific transactions. In the 
5 illustrated example, a promotional coupon is for the product of merchant other than the merchant of the transaction. 
This facilitates co-marketing campaigns by groups of merchants. The selection of which merchant's coupons to include 
in the illuminated transaction are again based on the profile of the consumer and the attributes of each selectable 
content alternative that is encoded in the statement for this transaction. Thus, different consumers having a transaction 
with the same merchant may receive entirely different coupons 312 or co-branding icons 308 because of differences 
10 in their personal consumer profiles. 

[0093] Finally, note that the top portion of the statement 304 has been augmented by the inclusion of a graphic icon 
306 for yet another merchant, one who in this case does not have any transactions specifically listed with the consumer 
in the credit card statement. As is more fully described below, the selection of this icon 306, and other illumination 
elements (e.g. 307, 308, 310, and 312) may be usefully managed by policies associated with different sections of a 
15 structured document (e.g. the header portion, each transaction line, footer, etc.), and the structured document itself, 
where the document sections have respective owners which define the policies for illumination. The use of policies 
further extends the opportunities for controlled targeting of promotional information, since each section of a structured 
document may have policies that differently influence the selection of content alternatives relative to a consumer's 
profile. 

20 [0094] The examples of Figs. 3A and 3B illustrate illumination for a credit card statement; illumination can be applied 
to any type of structured document generally so long as it is formatted with the information describing the selectable 
content alternatives, in accordance with the present invention, including any type of financial statement generally (e. 
g. bank statements or stock portfolio statements), or transaction statement generally (e.g. commercial transactions 
with a service or product vendor). 

25 

D. Example Applications of the Invention 

1 . Substitution of Information and Multimedia Content in Document Templates 

30 [0095] In this simplest of applications of TIC, a document or other form of media have the form of templates. The 
templates contain both static content and variable content sections. The variable content sections are tagged with 
variables or expressions, which are evaluated in the context of a client database to produce a description of the actual 
content to display in that section. For example, if the client database includes the name of the viewer as the value of 
a variable viewer.name then a document could be addressed to each specific viewer by specifying a variable section 

35 with tag viewer.name. 

[0096] Fig. 4 illustrates this application. Here an electronically presented coupon 400 includes a variable section 402 
that is encoded with a variable for the consumer's name; each of the four coupons illustrated has been customized in 
this manner from a same basic template. Upon presentation at the consumer's computer, the consumer's name is 
retrieved from the client database and inserted into the variable section, resulting in a customized coupon. 
40 [0097] Similarly tags could be used to specify the logo or URL or other contact information for a specific merchant 
or organization as in merchant. logo(merchantlD= 1 23456). These tags could be evaluated at any point in the production 
process including the context of the individual viewer. 

2. Customized World Wide Web pages 

45 

[0098] Another type of structured document are web pages that are sent in the form of HTML annotated documents 
over an intranet or Internet and viewed using a World Wide Web browser, such as Netscape Corporation's Communi- 
cator or Microsoft Corporation's Internet Explorer. The HTML tags determine the form and content of the various ele- 
ments of the documents. Typically, these tagged elements include references to other documents which are loaded 

so as the tags are evaluated by the browser. 

[0099] For this application of TIC. special tags are included in the document wherever selectable content is desired. 
These selectable content tags will include information which is evaluated with respect to the individual consumer's 
profile to produce a set of options for which content to present together, with criteria for determining a measure of 
appropriateness of each option depending on the attributes of an individual viewer. Examples of such attributes might 

55 include the age and gender of the viewer, her income level, whether she owns her home, dines out often, travels to 
Asia, has a pet, or whether she is interested in science. Based on its model of the viewer, TIC will evaluate the criteria 
for each option and choose the option with the highest measure of appropriateness. The consequence is that two 
different viewers of the same URL to a TIC enabled page could be presented with very different content and have a 
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very different experience. 

[0100] Figs. 5A and 5B illustrates two examples of this application. In Fig. 5A there is shown a sample web page 
500 that has been illuminated and augmented in view of the consumer profile of a new family. Here, the web page 500 
is the home page of a merchant, and has been augmented to include information pertinent to the interests of the 
s consumer/new family. For example, in the top portion of the web page, promotional information 502 for a dining program 
has been inserted, and in light of the new family's changed consumption patterns (e.g. repeated purchases of diapers 
indicates the presence of a baby in the family), information 504 promoting infant- related services of the merchant of 
interest to the new family has been included in the third line of web page. 

[0101] Referring now to Fig. 5B, this same merchant's home page has been illuminated differently for a different 
10 consumer, here an unmarried skier. In this web page 506, the top portion now has a different promotional item 508, 
promoting a ski vacation. Similarly, since to the transactions of the unmarried skier did not imply the presence of 
children, the third line of the web page now includes different promotional information 51 0, this time for a travel auction. 
[01 02] This example shows has a templated web page can be augmented with information that is particularly based 
on the consumer profile of the consumer viewing the web page. This approach differs from existing "personalized web 
15 pages," (e.g. "My Yahoo" at Yahoo.com) since the consumer profile is based on the transactional behavior of the 
consumer from both online and off-line behavior, instead of a mere list or specification of static preferences and selec- 
tions by the consumer (e.g. selections by the consumer of which news topics, stock names, eic. to include on the 
personalized page). In the foregoing examples, the consumer did not preselect which promotional information was of 
interest to her prior to visiting that merchant's web site. Indeed, no previous contact by the consumer to the web site 
20 was even necessary. Instead, the selection of information to illuminate the web page was made dynamically based on 
the consumer profile, even the very first time the consumer visited the web site. Another difference with existing ap- 
proaches to "personalized web pages" is that the consumer profile is stored under the control of the consumer's com- 
puter. This method provides an extra level of privacy protection. 

[0103] The range of selectable content for a web page or other structured document is unlimited. This application of 
25 TIC can be used for example to implement a personalized web based magazine where articles are chosen and pre- 
sented according to the viewer's interests and preferences. This application can also be used to allow very accurate 
targeting and personalization of advertisements and other kinds of commercial offers. 

3. Automatic Completion of Fields in Electronic Forms 

30 

[0104] This application illustrates both the illumination and the interpretation functions of TIC. When presenting a 
form to a user, TIC can treat it as a template in which the input fields of the form are treated as expressions which 
evaluate to the corresponding user data. 

[0105] For example, fields like name, address, social security number, marital status, annual income, are tagged 
35 and those whose values are available in the client database are automatically filled in before the form is presented to 
the user. This part of the application utilizes the illumination component of TIC. When the form is submitted, after filling 
in unfilled-in fields or changing fields pre-filled by TIC, TIC treats the form as a document to be interpreted. Thus, 
information in tagged fields whose values were not known to TIC, or for which the values have changed from those in 
the model, is extracted by TIC and used to update and extend the personal information in the client database. 

40 

4. Targeted TV Commercials 

[0106] This application of TIC is based on the observation that the bandwidth available for TV broadcast via cable 
or satellite is greater than necessary to support standard broadcast programming. One of the options for the excess 
45 bandwidth is to broadcast multiple commercials and other kinds of announcements simultaneously together with se- 
lection criteria. A television set-top box equipped with TIC selects the most appropriate of the possible commercials 
and announcements. 

[0107] For two-way interactive TV, an alternative is for TIC to target content in a manner similar to that proposed for 
a world-wide web implementation. That is, instead of sending down multiple commercials simultaneously with selection 
so criteria, only the criteria need be sent down a much smaller data pipe. The set-top box or interactive television can 
evaluate the selection criteria against the local database and request the appropriate commercial from the head-end. 
[0108] The next section further describes one embodiment of the present invention. 

E. Basic Concepts 

55 

1 . Products, Consumers, and Viewers 

[0109] By a "product," there is meant any object or activity which may be used for any purpose and whose use can 
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be observed either directly or indirectly through some form of report, i.e. a structured document electronically delivered. 
The use of a product is called "consumption." Any individual or group of individuals acting as a consuming unit of 
products will be referred to as a "consumer." Typically, the set of individuals making up the consumer unit represented 
in the totality of transaction reports analyzed by TIC will be a superset of the set of individuals actually viewing content 

s at any particular time. The term "viewer" is used to distinguish the entity (which may itself be a collection of individuals) 
actually viewing the targeted content on a consumer computer from the consumer unit as a whole. For example, a 
newborn will not participate as a viewer of targeted content, but will be a part of the total consuming unit. So, in general, 
infant products will be relevant to the consumer unit, which includes a newborn. However, depending on lifestyle, one 
individual may have more responsibility for the newborn than others may have, and thus knowing which individual is 

10 viewing the content is important for targeting content for the newborn. In general, the appeal of a given product or 
activity will depend both on the needs and interests of the consumer unit as a whole and on the special needs and 
interests of a particular set of viewers. A query used for targeting content may reference both. 

2. Profiles: Facts and Models of Consumers and Products. 

15 

[01 10] TIC distinguishes between "facts" and "models" and allows targeting of content based on either or both. Facts 
include information obtained by direct observation, extracted from transaction reports, or supplied by generic TIC fact 
databases. Examples of facts include information about individual transactions, statistical summaries of transactions 
(totals, averages, etc.), personal information obtained from forms, and generic contact, relationship and classification 

20 information about products and merchants. A targeting query based on facts might ask, for example, about purchases 
of particular products or classes of products from particular merchants or classes of merchants. A "model" means a 
theoretical or mathematical construct developed from facts and used to make conjectures and predictions about the 
consumer's current and/or future state and behavior. A consumer prof ile comprises the consumer model and a database 
including facts pertinent to the consumer. Within TIC, the function of a model is to predict the degree of appeal a given 

25 product has to a given consumer at a some time. 

F. Components of a TIC Model 

[0111] TIC can support a variety of models, even simultaneously, within a common framework. In one embodiment, 
30 a TIC model consists of the following components: 

1 . An Array of Characteristic Values. 

[01 1 2] The basis for a TIC model is a choice of characteristic values which together form a model of a given consumer 
35 or product at a given time. The nature of these characteristics and their values is a property of the model. For example, 
one approach is to characterize an individual in terms of certain behavioral attributes, such as conservatism, social 
consciousness, status consciousness and so forth. Another approach is to characterize behavior in terms of the degree 
to which the behavior correlates with certain archetypes such as the mom, the businessman, the professional woman, 
and so on. Alternatively, the characteristics may have little or no intuitive meaning. They may simply be formal math- 
^0 ematical constructs determined by some statistical analysis of consumer behavior. TIC does not limit the choice of 
characteristics or their interpretation except to assume that the values of characteristics can be represented as a real 
number. 

[0113] The characteristic values for an object will be represented as a vector of real numbers where each value 
measures the degree to which the corresponding characteristic applies to the consumer or product. For example the 
45 value of the conservative attribute may have a value between 0 and 1 where 1 represents complete conservatism and 
0 represents complete radicalism. The vector of characteristic values for a consumer represents a model of his interests 
and the vector of characteristic values for a product represents a model of the appeal of the product. 

2. An Appeal Function 

50 

[0114] In general, the greater the correlation between a consumer profile and a product profile at a given time, the 
greater will be the expected appeal of the product to the consumer. An "appeal function" is a procedure that computes 
the appeal of a given product to a given consumer as a function of the consumer and product profiles. For example, 
an appeal function may be based on a generalized inner product of the consumer and appeal profile, e.g. a weighted 
55 sum of the results of multiplying each corresponding pair of characteristic values. 
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3. An Assignment of Appeal Profiles 

[0115] Consumer profiles are developed from an analysis of the consumer's past transactions. The contribution a 
given transaction makes to the consumer profile is assumed to depend on the appeal profile of the product consumed 
5 in the transaction, the time of the transaction, and the amount of resources committed by the consumer to the trans- 
action. In order to make use of a transaction for modeling, the product must have a pre-assigned appeal profile. Typ- 
ically, appeal profiles for merchants or products will be assigned a priori based on known statistics, market research 
or the intuitive knowledge of experts. In addition, with the explicit permission of randomly selected consumers, con- 
sumer profiles can be fed back to the model in order to refine appeal profiles. 

10 

4. Consumer Profile Update Process 

[0116] The maintenance of the consumer profile is an on-going iterative process. Prior to having any knowledge of 
transactional data about the consumer, the consumer is assigned an initial profile based on known demographic infor- 
ms mation. The consumer profile update process analyzes each newly interpreted transaction, in particular, product or 
merchant profile, time of transaction and transaction amount to produce a refined profile. Typically, this process will 
depend on more than the current profile and the new transaction. It will probably also require the maintenance of 
aggregates and other summary data about the profile and transaction histories over various time intervals. As an 
example, it may require maintaining totals of resource consumption for each characteristic over various intervals, rc- 
20 membering specific transactions, such as purchases exceeding some threshold amount, and changes in profile over 
various intervals. 

G. Overview of TIC Components 

25 [0117] The TIC system is divided into clients and servers. A client system is one used by a viewer. A TIC server 
system is one external to client systems and which provides TIC software and generic information used by the client 
system for interpretation and selection. Except for providing information requested by the client, all the important TIC 
processing (i.e. interpretation and selection) takes place on the client system. In addition to TIC servers, the TIC en- 
vironment includes third party servers which provide report data for interpretation, illuminated content, and/or content 

30 requiring illumination. Either TIC or third-parties may also provide physical disk storage for encrypted data which is 
logically under the control of the client system. 

1 . Client Side Components 

35 [01 1 8] The client side of TIC consists of a database which represents the TIC model of the consumer together with 
methods which populate the database (from document interpretation) and use the model for targeting and personali- 
zation of content. 

a) Consumer Information 

40 

[0119] All personal information regarding a consumer is represented and stored as objects in the TIC database under 
the control of the consumer's viewing device. Selection criteria are implemented as queries against this database and 
may reference facts ("Are there any known transactions with Macys?"), models ("Is it likely that someone in the family 
is pregnant?"), or both. 

^5 [0120] The database of information about consumers maintained by TIC includes the following components. 

b) Facts database: 

[0121] The facts database includes,, personal information (e.g. name, address, social security number) about con- 
so sumers, transactions histories (e.g. credit card purchases from Macys) and summaries (e.g. totals, averages) of trans- 
action data together with facts about elements in the domains of the transactions (e.g. URL and phone number for a 
given merchant, the fact that two merchants are competitors, product to product category mappings, the fact that two 
products are complementary). 

55 c) Model(s) data 

[0122] This is data relating to the current state of each active TIC model. This includes the current characteristic 
values, summary data, deltas (i.e. changes), and whatever additional values are required by the model to continue the 
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update process and for model-based selection of content. 

d) Metadata 

5 [0123] Metadata describes data types, data sources, summary entries, relevance of facts to conjectures, current 
conjectures, integrity constraints, update frequencies, report formats and the mapping from report data to database 
schemas. The metadata also includes the history of the viewer's interaction with TIC and policies used by the content 
selection engine. The system is reflective in the sense that metadata can be queried and reasoned about in the same 
context as "ordinary" data. 

10 

e) Calendar 

[0124] The Calendar is used by the scheduler to manage model updates and maintenance. All components of the 
consumer database and models are dynamic. New reports result in additional database entries and over time facts 
15 may be forgotten (aged out of the database) leaving only their contributions to summary data. Conjectures change as 
confirming or refuting evidence is accumulated. New and changed inference rules and metadata are downloaded to 
the client periodically. 

f) Report Interpreter 

20 

[0125] The report interpreter parses online documents and interprets the entries as new facts. The report format and 
the mapping from report data to the schemas in the facts database may be self-documenting (i.e. specified within the 
report itself) or registered by source in a server database and downloaded and stored as metadata. As new facts are 
entered, triggers associated with consistency constraints arc activated to download additional data (e.g. the entering 
25 of a transaction referencing an unknown merchant cause information about the merchant to be downloaded from server 
databases.) Similarly when facts key to a conjecture are downloaded, the Modeling Engine is invoked to update con- 
fidence measures and initiate new conjectures. 

g) Modeling Engine 

30 

[0126] The Modeling Engine updates consumer models in response to new facts or when queried by the selection 
engine. 

h) Selection Engine 

35 

[01 27] The selection engine chooses what content to present to the viewer when presented with a selectable content 
option. The choice is controlled by appropriateness, as specified by policies of the media owner and content provider, 
relevance which is evaluated from models of the viewer based on relevance criteria specified by the content provider, 
specific constraints for a specific slot specified by the media owner, and overall business policies specified by the media 
40 owner and TIC operator. The relevance criterion is represented as a logical expression which is evaluated relative to 
the viewer model to produce a confidence measure via the same inferencing used to evaluate other conjectures. In 
representation and use, the criterion in effect defines the conjecture: "this content is relevant to the current viewer at 
the current time." 

45 i) Scheduler 

[0128] In the preferred configuration, TIC executes as a background service on the client computer. The scheduler 
is responsible for scheduling system maintenance including downloading reports to which the user has subscribed and 
regular updates of facts, rules, and policies. 

50 

2. TIC Server Side Components 

a) Server side knowledge bases 

55 [0129] The primary purpose of the server side is to serve as a repository for information required by the client systems. 
These include: 

• Report metadata: Information required to parse and interpret documents. 
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• Domain specific fact databases: Information relating to report elements. In interpreting credit card reports, for 
example, these include merchant and product IDs, contact and classification information for merchants, products, 
and manufacturers, and relationships between merchants, products and manufacturers. 

• URL and Software classification databases: These are particularly important for analyzing viewer behavior based 
5 on active web pages, windows, and processes. 

[0130] Rules and policies: The rules and policies used by the Modeling and Selection Engines for updating the user 
model and measuring and selecting content. 

to • Content Description Databases: Elements of these databases contain references to actual content together with 
relevance criteria and the metadata required to evaluate appropriateness and consistency with business policies. 

• Statistical, Accounting, and Administrative databases: These databases track the number of views and other sta- 
tistics needed for billing and determining overall effectiveness of TIC content selection. 

*5 H. Information Used for Selection 

1 . Aging/Evolution of Profiles and Data: Trend Prediction 

[0131] A consumer's profile changes overtime. Generally these changes are evolutionary and. in some cases, even 
20 predictable. Attributes will, in general, change at differing rates and the rate of change will, in general, itself depend 
on the current profile. Thus, for example, profiles for young people will change more rapidly than profiles of older people. 
Some changes are predictable. A pregnancy either terminates or results in a newborn; the newborn eventually becomes 
an infant; the infant becomes a toddler; and so on. In some cases, patterns of change are predictable but the precise 
timing is not (e.g.. at some point almost every adolescent becomes extremely concerned about his or her appearance 
25 though the individual age at which this occurs may vary). In these cases, certain specific indicators, e.g. purchase of 
acne medicine, may be used to discover that a transition has occurred. In general, specific kinds of changes can be 
recognized by changes in consumer profile, for example a purchase from a maternity store. One way of capturing time 
dependence which is used by TIC is to compute and maintain profiles over varying periods and track the deltas for 
these profiles, i.e. the changes in the profiles. Selection criteria and computed attributes can thus be based on behavior 
30 over specified periods or on changes in behavior. 

2. Selection Data 

[0132] The information used by TIC for selection has four basic components: 

35 

1 . Personal information and online behavior; 

2. Transaction histories and summaries; 

3. Conjectures about the psychographic and demographic attributes of the consumer and predictions of important 
events which will cause discontinuities and possibly fundamental changes in the consumer profile. A birth, a mar- 
riage, a divorce, the purchase of a house, and similar transitions are important in that they result in fundamental 
changes to the profile and very different behavior during the transition period. 

4. Conjectures about the psychographic and demographic attributes of the current viewer. 

[01 33] The f irst two components contain purely empirical data which is obtained either by direct observation of online 
45 actions or by interpretation of reliable reports. The third component is theoretical; it represents a guess about the 
consumer's characteristic behavior and lifestyle and potential changes based on information in the first two components. 
The fourth is also conjectural based primarily on online behavior and is used to distinguish the individual viewer from 
the consumer unit as a whole. 

50 3. Personal Information and Online Behavior 

[0134] Personal information, such as name and social security number, is extracted and accumulated from online 
forms filled out by the user or from the interpretation of online statements. The primary use of this personal data is in 
pre-filling online forms and as a source of demographic information, e.g. age, marital state, number and ages of children. 
55 [0135] Online behavior includes information like what software applications reside on the user's machine, what ap- 
plications are currently active, favorite websites, recency and frequency of visits, and TIC interactions including what 
content was shown when and what content elicited an online response from the user. Sources of online behavior 
include interpretation of log and history files maintained by browsers and direct recording of behavior within TIC enabled 
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documents. 

4. Transaction Histories and Summaries 

5 [0136] Transaction histories are obtained by interpreting online reports. A "report instance" means a document from 
a specific vendor describing a specific set of transactions, for example, a credit card report for a specific account 
specifying transactions that occurred over a specific period of time. A "report" means the format of report instances 
from a given vendor, e.g. a credit card report from Citibank. All report instances of a given report preferably have a 
common format, i.e. can be parsed by the same parser, (e.g. If a credit card company has multiple credit card report 

10 formats depending on the class of credit card then each is considered a distinct report.) A "report type" means a class 
of reports having a common interpretation, e.g. a credit card report. 

[0137] With each report type, TIC associates a database schema and summary data. Summary data includes sta- 
tistics on recency, frequency, and total amounts aggregated over individual products and vendors and over classes of 
products and vendors. All report instances for a given report type are interpreted into a common schema and summa- 
15 rized. Thus, all credit card reports will be interpreted into the same database schema regardless of source. The schema 
for a credit card for example would include fields for account, date of sale, amount, merchant identifier, reference 
number and location. 

5. Consumer and Viewer Profiles 

20 

[0138] The consumer and viewer models are represented as weight vectors over component attributes while the 
consumer profile depends on all observations and transaction reports. The viewer profile, for the current viewer, is 
computed only from observations and reports which are known to apply to the current viewer. The extent of viewer 
specific profile information available depends on the client environment. At a minimum, it includes actions within the 
25 current session of TIC. If a URL history report is available, it is interpreted by TIC because it is useful for differentiating 
between the specific interests of the current viewer and that viewer's interests as a member of the consumer unit. 

1. How Content is Targeted and Selected 
30 1 . Selection Criteria 

[01 39] Each element of a content description database contains a content specifier and a selection rule. The purpose 
of the selection rule is to determine whether the content specified will be of interest to a given consumer. A selection 
rule is represented as a query expression which is evaluated against the TIC consumer database. There are two basic 
35 kinds of selection criteria which may be combined using logical connectives (not, and, or) to form a given selection rule. 

2. Model Based Tests 

[0140] An appeal vector together with a threshold parameter defines a model based test. Model based tests may be 
40 specified for the consumer unit as a whole or against the individual viewer. The appeal vector is tested against the 
specified user model. The threshold parameter is used to vary the sensitivity of the test. 

3. Direct Database Queries 

45 [0141] These are arbitrary database queries against the transaction histories and summaries. For example, "is this 
consumer already a customer of mine?", "is the consumer a customer of competitor X", "has the customer spent at 
least a thousand dollars on consumer electronics products over the past year". 

4. Selection Policies 

50 

[0142] Selection criteria are associated with and used to determine the suitability of a specific piece of content to a 
given consumer or viewer. Selection Policies, on the other hand, are associated with potential locations for content. 
Policies are typically defined by the owners of these locations and are used to restrict the set of candidates considered 
for that location and to choose which candidate to use in case multiple candidates are suitable. For example, content 
55 appearing on an entry for "Circuit City" on a Bank of America credit card statement may be subject to policies defined 
by Bank of America which owns the page, Circuit City which owns the entry, and general policies of the TIC operator. 
Selection policies are essentially analogous to selection criteria in the sense that where selection criteria test suitability 
of a consumer, a selection policy tests suitability of content. 
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J. An Architecture for an Implementation of TIC on the World Wide Web 



1 . Overview 

s [0143] As noted earlier TIC is particularly well suited as an application for the World Wide Web. The viewer views 
content via an Internet browser such as the Communicator from Netscape or the Internet Explorer from Microsoft. In 
this context, report data is accessed via HTTP and represented as XML annotated text. In reports, the XML tags are 
used to specify the report type, report source, and report format. 

[0144] Web Pages with TIC selectable content options are represented as XML annotated HTML text. Special XML 
10 tags mark selectable content with selection constraints and display criteria. TIC mediates the display of these pages. 
TIC can be con Fig. d as a plugin, like Shockwave, or as a Java applet. TIC enabled web pages are pre-processed by 
TIC before being passed to the browser for presentation. XML tags specifying the format and interpretation of inter- 
pretable report data are evaluated by TIC and the report data is interpreted. Selectable content tagged data is evaluated 
to produce an HTML reference via a TIC URL to actual content and that reference is substituted for the selectable 
15 content specification. TIC specific tags are stripped and the resulting page passed to the browser for display. The TIC 
URL substituted for selectable content is reported to the TIC accounting server. 

2. Client Side Components 

20 [0145] Referring to Fig. 6, there is shown one preferred embodiment of the client side 600 components of TIC as- 
suming Windows NT as the client operating system and Netscape 4.0 as the client browser X. and the server side 
components. In this embodiment, TIC is written in Java. A similar implementation can be constructed for other client 
operating systems and network browsers. 

25 a) Secure Client Database 

[0146] The heart of the client side 600 of TIC is a secure database J. All consumer-specific persistent data of any 
kind including transaction data, metadata, model data, calendar data, policies, administrative information, the location 
of TIC servers 604, reside in and are only accessible through the database interface. Access to the database is through 

30 Java. Only TIC software has access to the database access routines and the on-disk forms of the data are encrypted 
with a private key under the control of the client (i.e. consumer computer). Fig. 6 illustrates the client database J as 
locally resident on the client/consumer computer, which is one preferred embodiment. However, in other embodiment, 
the client database be physically resident on other devices, and only logically controlled by the consumer computer, 
so that access to the database still remains fully under the control of the consumer. 

35 [0147] In addition to maintaining client-specific data, the client database J behaves as a cache to the TIC server 
databases 604. Thus ail queries, even for generic information such as data about merchants or products, are evaluated 
first in the client database J. If no information is available locally and there is no flag indicating that global information 
is unavailable, a remote query for the data is evaluated at a TIC server 604. If no information is returned, a flag indicating 
that information is unavailable is stored along with a timestamp. If the requested data is available, it is returned to the 

40 client and stored in the client database J. 

d) System Maintenance Service 

[0148] In the preferred implementation, the downloading and interpretation of transaction reports and communication 
45 with TIC servers for software maintenance and other generic data requests are handled by a system maintenance 
service L running as a background process. Activities of this service L are driven by events scheduled on the TIC 
calendar, which is a list of octivity, time> pairs in the client database J, which are used by the scheduler M to initiate 
the activities at the proper time. Version information and the protocols and information necessary to access consumer 
transaction reports, along with the event calendar are stored in the client database J. Before carrying out a scheduled 
50 activity, the service L queries the client database J to determine whether the system libraries have been updated. If 
so, the relevant libraries are reloaded. 

a) Scheduler 

55 [0149] The scheduler M is the software service responsible for triggering maintenance service and report interpre- 
tation for subscription reports. 
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b) Subscription Service 

[0150] Requests for regular maintenance updates and report downloading are handled through the subscription 
service K which in turn updates the system events calendar. 

5 [0151] TIC Interface and HTML/XML Parser: Access to TIC from the browser X is through an interface H provided 
by JavaScript and Java Applets. Since TIC requires access to local persistent storage and network access to commu- 
nicate with TIC databases J, the Java applets run in trusted mode. All sites making use of TIC are certificated. 
[0152] As described in the overview section, fields for interpretation and illumination are tagged using XML (Extended 
Markup Language) which is parsed by an XML parser. The first step in the interpretation or illumination of a web 

10 document is to parse it into list structures. This is the role of the parser. After this is done, all manipulation of the page 
is carried out on the list structures. 

e) Report Interpreter 

15 [0153] The report interpreter I operates on parsed reports. It first pulls out the XML tagged components that identify 
the report. The interpreter I then queries the client database J for the type and format of the report. The format is 
represented as an object which includes a method for parsing the list structured form of the report into a stream of 
logical records. The report type includes a method which interprets each logical record. This method is applied to the 
stream of records. 

20 

f) Page Illuminator 

[0154] The page illuminator N operates on a parsed page. It generates a list of the illumination forms, i.e. the forms 
with XML tags specifying TIC illumination. It then passes that list to the selection engine, which is inside Page Illuminator 
25 N, to choose the actual content (possibly the empty content) to substitute for each illumination form. The resulting 
structure is then converted back to text by an HTML Printer and streamed, using JavaScript, to a browser window for 
display. 

4. Server Side Data Servers 

30 

[0155] The server side components of TIC are essentially passive data providers and are managed by the TIC op- 
erator or third-parties. 

a) TIC Data Servers 

35 

[01 56] The primary purpose of the TIC data servers 604 are to serve as a repository of software and domain specific 
data related to the subject matter of reports, such as product descriptions and vendor contact information. These are 
represented as standard relational databases with standard web-based interfaces used to populate, query, and modify 
them. Specifically, there are four types of servers managed by the TIC operator. 

40 

1 ) Accounting Server 

[01 57] The function of the accounting server A is to track the use of illuminations for billing purposes. The accounting 
server A is updated as conditional illuminations are selected by the targeted URL server D for presentation to the client 
45 600 and again when the chosen illumination is selected by the client 600. 

2) Domain Knowledge Server 

[0158] The domain knowledge server B contains domain specific data related to report elements. In a credit card 
so context, for example, this data provides information about merchants including address, phone numbers, URLs. 

3) Metadata Server 

[0159] The metadata server C provides software, policies, report format, and other forms of non-domain specific 
55 data required by TIC to interpret and illuminate documents. 
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4) Targeted URL Server 

[0160] The targeted URL server D presents selectable content options to the page illuminator N. These options 
consist of queries with associated content tokens. The queries are evaluated against the client database J to determine 
5 the most appropriate content tokens. When the page illuminator N has selected among the options, the content tokens 
are converted to URLs and the accounting server A is updated. 

5. Third Party Servers 

10 [0161] A third party server 602 can be any website which provides documents or data used by TIC for interpretation 
and illumination. Conceptually, there are three kinds of servers. However, a given website may act as one or ail of 
these types. In fact, a given document may be both interpretable (i.e. have a report format which is parsed into data 
fields for updating the secure client database) and illuminable (i.e. have embedded TIC-tags which are replaced during 
the illumination process). 

15 

1) Report Server 

[0162] A report server E is a source of the interpretable reports used by TIC to populate the secure client database 
J used in the illumination process. 

20 

2) Illuminable Document Server 

[0163] An illuminable document server F provides illuminable documents, which are web content enabled for TIC 
illumination. These documents include a Java preamble which invokes the TIC client side processing and TIC tags for 
25 'illuminable elements. 

3) Illuminations Server 

[0164] Since illuminations are implemented as URLs, any website can be a server G for illuminations for illuminated 
30 pages. The page illuminator N simply substitutes the appropriate HTML text. The Internet browser X then embeds the 
content automatically in the displayed document. 

K. Exemplary Interpretation and Illumination Processes 



[0165] Referring again to Fig. 6, there is shown various illumination and illumination processes using the above 
described architecture. In these processes, data flow occurs over either the Secure Sockets Layer or standard HTTP. 



Data/Process Flow 


Action 


1 . Scenario 1 : Client views TIC enabled report 


1 

2 

3 
6 


The report server E sends a TIC tagged report in response to a client request. 

The TIC client service 600 interprets the report and sends a request to TIC servers 604 for 

domain knowledge, metadata and a list of content selection queries. TIC client service 600 

also updates TIC secure client database J with relevant information. 

TIC client service 600 evaluates the list of queries against the consumer model in the client 

database J to select the most relevant selection and requests the targeted URL server D to 

send the URL associated with the selection. The targeted U RL server D also logs the selection 

with the accounting server A for accounting and billing purposes. 

TIC client service 600 gives control back to browser X which displays the page by resolving 
the URLs placed by the TIC client service 600. The content for these URLs are resolved from 
illumination servers G. 




2. Scenario 2: Client views TIC enabled Home Page 


4 
2 


An illuminable document server F sends a TIC tagged home page in response to a client 
request. 

TIC client service 600 interprets the page and requests TIC servers 604 for metadata and a 
list of content selection queries. 
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(continued) 





Data/Process Flow 


Action 


5 * 




2. Scenario 2: Client views TIC enabled Home Page 


10 


3 
6 


TIC client service 600 evaluates the list of queries against the consumer model to select the 
most relevant selection and requests the targeted URL server D to send the URL associated 
with the selection. The targeted URL server D also logs the selection with the accounting 
server 

A for accounting and billing purposes. TIC client service 600 gives control back to browser X 

WillUIt Ulopleayo 

the page by resolving the URLs placed by the TIC client service 600. The content for these 
URLs are resolved from illumination servers G. 


15 




3. Scenario 3: Client views TIC enabled Forms 


4 


An illuminable document server F sends a TIC tagged form requesting fields to be pre-filled 
from the secure client database J. Upon client 600 sending form back to server F, TIC client 
services 600 interprets the form and updates the secure client database J with field additions 
and changes. 


20 




4. Scenario 4: TIC secure client service requests scheduled report 


25 


5 
2 


Report server E sends a TIC tagged report in response to a scheduled request from the 
scheduler M of the client 600. 

TIC client service 600 interprets the report and requests TIC servers 604 for domain 
knowledge and metadata. TIC client service 600 updates TIC secure client database J with 
relevant information. 



L. Client-side Procedures 



30 [0166] Within the browser, TIC is activated by JavaScript preambles in pages loaded from TIC enabled web sites. 
Fig. 7 illustrates the client side processes that occur when illuminating and interpreting TIC enabled documents. 

1 . Installation 

35 [0167] A TIC enabled web page initially arrives 702 at a client computer. Browser pages from TIC enabled sites 
contain a JavaScript preamble which tests 704 the client environment to determine whether TIC components have 
been installed on the client and is up to date. If TIC has not been installed, the client is offered the option of downloading 
and installing TIC. The installation procedure 706 installs Java libraries, initializes the TIC database, and creates, 
enables, and starts the TIC background service process. If TIC is installed but not up to date, new libraries are installed, 

40 an update applet run, and the version information in the database updated. 

[0168] Models are represented as subclasses of the abstract class Model. In addition to having the attributes and 
methods described earlier, objects of class Model have an installation method which creates the database schema for 
the model including storage for characteristic values and summary data with associated triggers. The installation meth- 
od also registers the Model in the database so that it can be used. Installation of a model involves downloading the 

45 class definition for the model and running the install method for the model. 

2. Invoking TIC from the Browser to Process a Page 

[0169] After any necessary TIC installation has been completed, the JavaScript preamble in a TIC enabled page 
so determines 71 0 whether TIC processing has been disabled on the client. (Disabling of TIC would typically be done at 
the request of the viewer.) If so, then the page is passed 708 through to the browser without any TIC processing. 
Otherwise, the page is passed to a top-level routine which parses the page and determines 712 whether there is any 
interpretable or illuminable content. If there is interpretable content, it determines 714 whether the report has already 
been interpreted. If not, then depending on policy 716 it may schedule it to be interpreted later by the interpreter process 
55 or have it interpreted 718 immediately. If it has illuminable content 726, the illuminator is invoked 728; otherwise the 
page is just passed 730 through to the browser for display. 

[0170] For regular reports with interpretable data, at the discretion of the vendor, a JavaScript may be run which 
offers the viewer the option of subscribing to the report, i.e., arranging to have the TIC service process automatically 
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download and interpret the report at specified intervals and times. 

3. Interpreting a Report Record 

5 [0171] The task of interpreting a report record is handled by a method of the report type. Every report type has an 
associated schema which includes a table or tables for the transaction history with triggers for updating summary data 
and preserving referential integrity. The interpretation method first inserts the record data into the table or tables which 
automatically causes summary data to be updated and additional data to be downloaded as necessary to preserve 
references. It then cycles through the list of registered models applying the update methods to the transaction. 

10 

4. Selecting Content 

[0172] The role of the selection engine is to determine which illuminable elements on a page to illuminate and for 
those choices what content to use in the illumination. In general, policies will constrain the number and types of illu- 
minations on a page. These policies are defined by parameters which constrain the density and total number of illu- 
minations and by rules in the form of queries to be executed against content databases (see below) which constrain 
the candidate sets for a given page and a given illuminable element within the page. Policies may be associated with 
the type of the page, the owner of the page, and the owner of a given illuminable element within the page. In addition, 
a specific page may contain explicit policy constraints expressed using XML. In addition to constraints, policies may 
also specify strategies and heuristics to use in finding and choosing a satisfying set of illuminations and choosing a 
particular illumination for a given element. For given a illuminable element, factors influencing the choice might include 
the appeal of the content based on one or several active consumer models, frequency constraints or requirements for 
a given choice of content, vendor preferences of TIC, the owner of the page, and the owner of a given element. Like 
constraints on content, the policies regarding choice strategies can be specified implicitly by type and owner and ex- 
pressed explicitly within the page using XML. 

M. Hierarchical Models 

[0173] Referring now to Fig. 8, there is shown one embodiment of a system in accordance with the present invention. 
30 The system 800 includes: 

[0174] A computing device 802 with local memory, computing capability, persistent storage, a display, and a network 
connection (e.g. a personal computer with a modem connected to an Internet Service Provider or an intelligent television 
set-top box connected to a cable head-end). The computing device 802 has access to a database 804 (logically part 
of the computing device, but it could be resident outside of the device, such as on a remote server computer maintained 
35 at a web site), which contains facts extracted from the process of interpretation 806 of a structured document. These 
facts are used to build an attribute vector 808 through the use of a mapping subsystem 810. Each new fact entered 
into the database 804 is presented to the mapping subsystem 810, which uses knowledge obtained from a metadata 
server (Fig. 7) to update numeric fields of the attribute vector. The metadata is cached in the local database 804 and 
so is available to mapping subsystem 810. 
4£> [0175] Elements of the attribute vector 81 0 express such consumer attributes, interests, preferences, or demograph- 
ics. The data elements are preferably scaled values, though actual values, Boolean, or qualitative values may be used. 
For example, scaled values may be used for attributes are both numerical, such as relative income (e.g. on a scale of 
0 to 1 , where 0 is very poor and 1 is very rich), or qualitative, such as "interest in tennis" (e.g. 0 is none, 1 is very tennis- 
centric). The selection of which attributes to encode in the attribute vector is within the discretion of the system designer 
45 and may reflect any useful categorization or attributes of consumer interests, preferences, demographics, or the like. 
[0176] The elements of the attribute vector 810 can be accessed directly, or can be aggregated and abstracted in a 
variety of ways, which will be described in detail shortly. A useful abstraction is the Boolean Abstractor 812, which 
examines elements of the attribute vector 81 0 and creates logical expressions from them. For example, if the attribute 
vector for consumer C contains an element Xj which represents degree of confidence that consumer C is pregnant, 
50 then the Boolean Abstractor may contain a rule that maps the query pregnant? (C) (which returns TRU E or FALSE) to 
the threshold inequality: x f > .75, which can be checked in the model's attribute vector 808 for a specific consumer. 
[0177] The goal of creating the attribute vector is to allow illumination candidates to be evaluated and ranked ac- 
cording to their relevance to the consumer. An illumination consists of content, plus a query that may be Boolean or 
may be a function of elements of the attribute vector. A set of unsorted illuminations 814 is presented to the system 
55 802, and forms a group of selectable content alternatives that may be used to illuminate portions of the document. The 
unsorted illuminations 814 are received in association with a structured document or portions thereof, electronically 
delivered to the consumer's computer 802. The unsorted illuminations 814 may be received from one or more content 
providers, even for a single document. For example, if the structured document is a credit card statement, the merchant 
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identified in each statement line may be responsible for transmitting a number of illuminations to be applied only to its 
one line. Each content provider will generally have only a limited display area in which to present the illuminations 
when the document is displayed on a display device. 

[0178] The Illumination Sorter 81 6 then uses data from three data sources {the database 804 directly, the attribute 
s vector 808, and the Boolean Abstractor 812) to do two things. First, it selects a set of illuminations that match either 
facts about the consumer in the database 804 or the consumer's attribute vector 808 well enough. Second, the Illumi- 
nation Sorter 816 sorts the selected illuminations in an order determined via a match score computed from the three 
data sources. The sorted illuminations 8 1 8 are then presented to an Illumination Display subsystem 820 for presentation 
to the consumer. 

10 

N. The Attribute Vector 

[0179] The present invention includes a novel method for storing and using information about individual consumers 
as derived from the transactional behavior of the consumer. The act of interpretation results in a number of facts stored 
15 jn a database. These facts can be very specific, for example, that a consumer spent $1 36.78 at Hertz Car Rental in 
Sacramento on January 4, 1 999. In some cases, an advertiser (such as a competing car company) might want to target 
individuals who rented from Hertz. Such a query is easy to do in a standard database, for example, one that uses SQL 
commands. 

[0180] However, in other cases, an advertiser may wish to target individuals who rent cars relatively frequently. In 
20 this case, it would be beneficial to have an element of a model's attribute vector represent a propensity to rent cars. 
Such an element should have the ability to aggregate car rental data across a number of different transactions. It should 
also be possible to allow the strength of the attribute to "decay" over time, so that if a consumer does not rent a car 
for some period of time, this attribute becomes less pronounced. 

25 I. Hierarchical Models Using Attribute Vectors 

[0181] Fig. 9 depicts an attribute vector 808, comprising a set of hierarchical vectors 902, 904, and 906. Abase level 
vector 902 shows the vector quantity x comprising a number of base level attributes, having scalar values x n> x 2 , x 3 , 
up to x n . Each scalar value can represent a different consumer attribute. For example, x, may denote an interest in 

30 football, x 2 an interest in baseball, x 3 an interest in volleyball, etc. Obviously, depending on the level of granularity in 
the description, there may be a large number of scalars. These may be grouped into, abstract collections, each rep- 
resented by an aggregated attribute value. For example, consider hierarchical vector 904, a vector quantity a comprising 
scalars a 1t a 2( -a 3 , ...a m . Each aggregated attribute in this vector 904 is associated with a selected plurality of base 
level attributes. Here, the scalar a 1 is an aggregated attribute that may represent an interest in sports. This value is 

35 an abstraction of data in vector 902. In one embodiment, a., is an appropriately weighted and normalized sum of the 
elements through x 6 of vector 902; similarly, a 2 is a weighted and normalized sum of x 7 through x 9 , and represents 
an abstraction of the attributes that these scalars x 7 .. .x g represent. Element a 3 is the same as element x 10 of vector 902. 
[0182] This abstraction process may be repeated as needed at further levels of abstraction, wherein one hierarchical 
vector is used as the base level vector for another hierarchical vector. For example vector 906 has an aggregated 

40 attribute b t which is associated with a selected set of vector 904's aggregate attributes a n and a 2 , and has value which 
derived from the values of the aggregate attributes. In this way, hierarchical representations of consumer interests may 
be built. A general "sports interest" aggregated attribute in one vector may be derived from lower level attributes for 
football, baseball, basketball, tennis, volleyball, etc. A genera! "culinary interest" may likewise derived from "cooking" 
and "restaurant going" attributes, the latter being further derived from specific restaurant type attribute, such as Chinese, 

45 Thai, Italian, etc. 

[0183] Boolean abstractions of the data are also possible. For example, one example of a function in the Boolean 
Abstractor 812 computes the threshold function [(x^ > .4) & (x 7 > .7)] where x 2 and x 7 are taken from the appropriate 
vector and elements (basic or aggregated) of an attribute vector 808. If x 2 indicates an interest in football and x 7 an 
interest in San Francisco, then the threshold function might represent the Boolean query °49'ers-fan?(C)" for consumer 
50 c, thereby allowing a content provider to craft a Boolean query for San Francisco 49'er-related targeting of their con- 
ditional content. Similarly, threshold tests can be created forfrequent-car- renter? and pregnant?, and so forth. These 
functions are merely exemplary; the present invention allows any type of Boolean function on the elements of an 
attribute vector to be implemented and passed dynamically to the Boolean Abstractor 812 for execution. 

55 2. Illumination Selection Process 

[0184] Fig. 10 depicts the process flow for matching an illumination against an attribute vector and data in the data- 
base 804. As previously mentioned, the database 804 contains facts derived from the consumer's transactions, plus 
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relevant metadata cached after retrieval from the metadata server. The mapping subsystem 81 0 has updated the fields 
of the attribute vector 808, and the Boolean Abstractor 812 provides additional Boolean functions that may be used in 
matching vectors associated with received conditional content against the consumer's attribute vector 808. 
[0185] The Illumination Sorter 81 6 selects and sorts a set of illuminations by measuring each against the data sources 
5 804, 808, and 812, using the matching subsystem 1018 to compute a match score 1020 for each illumination. Illumi- 
nations whose match score is above a threshold associated with each illumination are selected; the selected illumina- 
tions are ordered by their match score to form the sorted illumination list. 

[0186] It should be noted that all of the foregoing testing of illuminations and generation of match scores occurs 
entirely under the control of the consumer's computer, and thus without the providers of the illuminations having any 

10 access to the highly sensitive and private information about the consumer that is contained in the database 804. 

[0187] In order to describe the illumination matching process in more detail, it is useful to describe the components 
of an illumination. An illumination 1030 comprises three main components: The content 1032, which will be used if the 
illumination is selected for display; the Boolean query 1034, which provides for matching against either facts in the 
database 804 or abstracted data from Boolean abstractor 81 2; and, model data 1 036, further comprising a target vector 

'5 1 038 and a relevancy vector 1 040. 

[0188] The content 1030 may be any type of displayable content, including text, hypermedia, images, animations, 
audio, video, and the like. 

[0189] Accordingly, the Illumination Sorter 816 includes a Boolean Matching 1016 and Metric Matching 1018. The 
Boolean Matching 1016 evaluates the Boolean query 1034 used to select all illuminations that do meet the query 
20 constraints with respect to the facts in the database 804 or abstracted data from the attribute vector 808 via the Boolean 
Abstractor 812. 

[0190] Thus, all illuminations for which the Boolean query evaluates to TRUE are selected from the set of illumina- 
tions. The meaning of this result is that such illuminations do match facts or data descriptive of the transactions, inter- 
ests, preferences, or demographics of the consumer whose computer will potentially be selected for display. 
25 [0191] Each Boolean query may also be given a "priority, w which is used to sort a set of illuminations that only use 
Boolean queries, and do not use model queries. The priority is generally assigned by the content provider providing 
the illuminations. Those illuminations with higher priorities will appear earlier in the sorted illumination list. If no data 
from the attribute vector is used in the matching process, the priority assigned to the query is the match score produced 
by the matching subsystem. 

30 [0192] While Boolean querying alone may be used to select and sort ilium inations, it is preferable to employ model 
data of the consumer model for more fine-grained targeting of illuminations. This is achieved through associating a 
target vector 1038 with each illumination. This target vector 1038 is then compared by the Metric Matching 1018 to 
generate a match score, as a function of the distance (in multidimensional attribute space) between the attribute vector 
808 of the consumer and the target vector 1038 of an illumination. In one embodiment, the match score is the metric 

35 distance between the two vectors; in other embodiments it may be a combination of the measure of the angle between 
the vectors (as measured from the origin of the multidimensional space) and the measure of the length of the two 
vectors. 

[0193] The attribute vector 808 and target vectors 1038 used by the Metric Matching 1 01 8 may be the full vector of 
attributes (e.g. vector 902 in Fig. 9) or any abstraction of the full vector (e.g., vectors 904 or 906). However, a content 

40 provider may be interested in comparing only a selected number of the components of the target vector 1 038 with the 
attribute vector 808 of a consumer. Accordingly, in one embodiment the illumination also contains a relevancy vector 
1040, to indicate which elements of the target vector 1038 are important. If an element of the relevancy vector 1040 
is set to zero, the attribute represented by the element is of no interest in the matching; if set to 1 , it is completely of 
interest. A relevancy vector value may vary between 0 and 1 . indicating that the attribute is of any arbitrary level of 

^5 interest. 

[0194] The final match score is computed by the Metric Matching 1018 as a combination (e.g. weighted sum or 
product) of the priority resulting from the Boolean query, and the distance metric from the target vector 1038 to the 
consumer's attribute vector 808. 

[01 95] With appropriate normalization, the distance metric chosen will have a match score of 1 if the match is perfect, 
so and will asymptotically reach zero as the match becomes poor. Thus, the model score and the Boolean query with 
priority can be reconciled and sorted in a manner that yields useful results. 

3. Updating a Consumer Model 

55 [0196] This section describes one embodiment for updating the component elements of the attribute vector 808 in 
a consumer model. Updating is managed by the mapping subsystem 810. 

[0197] Each element in the attribute vector 808 represents a consumer preference, interest, psychographic, demo- 
graphic aspect, or alternatively, the probability the consumer is interested in a specific topic, category, and the like. 
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These attributes are updated as a function of a measure of relevancy of a transaction to each attribute. 

[0198] A number of different measure of relevancy may be used to update the attribute vector. In one embodiment, 

the measure of relevancy of a transaction is the conditional probability of the transaction occurring given the consumer's 

preference, as expressed in a selected attribute. Updating here may be done using Bayesian analysis. 

[0199] Let P(xld) denote the probability of x given data d (for example, that a specific event occurred). Let P(d Ix) 

denote the probability of the data (the event occurring), knowing the prior probability x. Let ~x denote the negative of 

x. And let P(d) be the a priori probability of data. 

[0200] Bayes' Theorem states that: 

P(x Id) = P(d| x) P(x) / P(d) 

[0201] But in the case where P(d)= P(dx) P(x) + P(d I ~-x)(1 - P(x)), i.e. , the probability of the data is the conditional 
probability under the case for x or ~-x, we get: 



P(x d) = P(d |x) P(x) / [P(d x) P(x) + P(d| ~x)(1 - P(x))] 

[0202] An example using real-world extensions to the variables will make it easy to understand. The probability 
20 values are shown in table 1102 of Fig. 11. Consider a case where x indicates an interest in cooking, and the data of 
interest is a transaction that occurs at Williams-Sonoma, a cooking-oriented store. Then: 

• P(x) = "Degree of (or probability of) interest in cooking" 

25 • A priori value = .001 

• Shown as the top-most value in column 1110 in table 1102 of Fig. 11 

• —x = not x = "Non-interest in cooking" 

• P(~x) = 1 - P(x) = "Degree of (or probability of) non-interest in cooking" 

30 

• Computed value - 0.999 

• d = "A transaction at Williams-Sonoma occurs" 

35 • This event triggers the application of Bayesian analysis by the mapping subsystem 810 

• P(d I x) = "Probability of a transaction at Williams-Sonoma occurring given an interest in cooking" 

• Assigned a value of 0. 1 based on statistical regression of existing data 

• Shown in column 1104 in table 1102 of Fig. 11 

40 • P(d hx) = "Probability of a transaction at Williams-Sonoma occurring given there is a non-interest in cooking" 

• Assigned a value of 0. 02 based on statistical regression of existing data 

• Shown in column 1106 in table 1102 of Fig. 11 

45 • p(x I d) = "Degree of (or probability of) being interested in cooking given a transaction at Williams-Sonoma occurs" 

• Value is to be calculated and used as new value of P(x) 

• Values are updated in successive rows of column 1110 in table 1102 of Fig. 11 

so [0203] The following code fragments (written in the Java programming language) illustrate one embodiment for Baye- 
sian updating by the mapping subsystem 810. Each attribute in an attribute vector is an instance of class Attribute. 
This class is defined below, as is the update function for the Attribute: 
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15 



20 



public class Attribute { 
// an attribute has a value. It 

// also has a name by which it can be referenced. 
private double valued- 
private String name; 



public boolean update (Transaction txn) { 

// update the value of the attribute according to 
// the Bayesian method. Return true if the value was updated, 
// or false if the value is not updated. 

// First fetch P(d I x) and P(d \ ~x) from the database (804) 
// using the fetch method of the Transaction object. This will 
// go out over the net to get the metadata from a metadata server 
// if it is not cached 
double pdx, pdnx; 
try ( 

25 pdx =* txn. fetch ("pdx", self); 

pdnx = txn. fetch ("pdnx** , self); 

// If there is no probability data, the transaction is 
// not relevant to this attribute and no updating is done 
30 ) catch (NoProbabilityOataException e) {return false; ) 

// otherwise apply Bayes* updating 

value » (pdx * value) / (pdx * value + pdnx * (1 - value)); 
return true; 



) 



40 [0204] The AttributeVector class contains an array of Attributes. This code fragment illustrates one example of how 
the attribute vector is updated for all transactions in a report: 



45 
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public class At tribut eVector ( 
5 // an array of attributes 

private Attribute at tributeArray []; 

public void updateAt tVector (Report report) { 

// the function txnList returns all transactions on the report 
List txnList - report . txnList () ; 
List txns; 
Attribute att; 

15 

// here we loop over each element in the attribute vector 
for (int i=0; KattributeArray . length; i++) ( 
att » attributeArray (ij ; 
20 // now,, for each txn in the list, we attempt to update, 

while (txns) { 

Transaction txn = (Transaction) head(txns); 
txns = tail ( txns) ; 
25 att .update (txn) ; 

I 

) 

} 

30 ) 



[0205] Figs. 1 1 a and 1 1 b shows the result of an example Bayesian update by the mapping subsystem 81 0, with the 
conditional probabilities being used to build up confidence in an attribute. In Fig. 11a, table 1102, there is shown an 
35 initial value for P(x) to be .001 (as indicated in the row specified by transaction number 0 in column 1108), indicating 
no significant interest in cooking. After four transactions, the value of P(x) in column 1110 rises to 0.385, indicating 
moderate interest. After eight transactions, P(x) has gone to 0.997, indicating a very strong preference for cooking- 
related activities. 

[0206] The ratio of P(d I x) to P(d I ~x), as shown in columns 1104 and 1106, determines the speed with which the 
40 certainty of interest increases with each transaction that occurs. 

[0207] Fig. 11b illustrates a second example in the table 1112 involves car rental, where the ratio of P(d I x) to P(d I 
~-x), in columns 1114 and 1116, is larger, and the convergence on virtual certainty of interest - P(x) in column 1120 - 
is faster. 

[0208] The speed with which the certainty of interest changes may be characterized as the velocity of interest. High 
45 velocities of interest for any particular preference, attribute, and the like may he used to interchanges in a consumer's 
(merest, and to further infer changes in facts or circumstances about the consumer, such as getting married, having a 
child, purchasing a car. a home, and the like. 

[0209] While Bayesian updating allows new information to enter the system, there is no sense of time in the equation. 
Time is important because transactions that occurred long in the past should have less relevance in determining today's 
so preferences. Thus, the present invention introduces a notion of "decay" in the life of the transaction data, which reduces 
the influence that older transaction have on the attribute vector 808. 

[0210] Decay of the influence of older transactions on the attribute vector 808 may be achieved by updating the 
attribute vector 808 using the conditional probabilities of no transaction occurring during a given period of time. This 
has the effect of introducing a "non transaction" that occurs when other transactions do not, and updating the attribute 
55 vector 808 accordingly. 

[0211] Figs. 12a and 12b illustrates this technique. All probability values are the same as in Figs. 11a and 11b, 
respectively, except for the introduction of two new quantities: 
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• P(n I x) = "Probability of no cooking-related transactions occurring given an interest in cooking" 

• Assigned a value of 0. 9 based on statistical regression of existing data 

• Shown in column 1204 in table 1202 of Fig. 12a 

• P(n I -x) = Probability of no cooking- related transactions occurring given there is a non-interest in cooking" 

• Assigned a value of 0. 98 based on statistical regression of existing data 

• Shown in column 1206 in table 1202 of Fig. 12a 

[0212] Looking at the table 1202 of Fig. 12a, there is shown a Williams-Sonoma transaction in period 1 and another 
in period 2, where the periods are shown in column 1 21 2. These two transactions yield the same values of P(x)- shown 
in column 1 21 4 -- as in the non-decay example of Fig. 1 1 a, namely 0.005 for period 1 and 0.024 for period 2. However 
in period 3, there is no Williams-Sonoma transaction, and no other cooking-related transaction. Hence the non-trans- 
action - involving P(n Ix) and P(n l~x), shown in columns 1204 and 1206 respectively - is employed, and the attribute 
vector 808 is updated during this period with the appropriate probabilities for the non-transaction. This non-transaction 
has the effect of reducing the probability estimation for interest in cooking. 

[0213] Interest again rises with another Williams-Sonoma transaction in period 4, and again in period 5. But periods 
6 and 7 lead to a decline in the probability of interest in cooking, and period 8 (with a Williams-Sonoma transaction) 
again leads to an increase. 

[0214] The code fragment relevant to updating the entire Attribute Vector based on the use of Bayesian updates, 
including the use of the "non transaction" for implementing decay is as follows. Note that the method updateAttVector 
has been altered to accept as arguments a Period as well as the Report: 
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public class AttributeVector { 
// an array of attributes 
private Attribute attributeArray [J; 

public void updateAttVector (Report report, Period period) { 
// the method txnList returns all transactions on the report 
// that occur within the specified period 
List txnList = report . txnList (period) ; 
List txns; 
Attribute att; 
15 boolean updateApplied; 

// here we loop over each element in the attribute vector 
for (int i=0; i < attributeArray . length; i++) { 
att = attributeArray(i) ; 

// now, for each txn in the list, we attempt to update. 
// If we do update, we note 

// that a transaction update has been applied 
updateApplied = false; 
while (txns) { 

Transaction txn = (Transaction) head (txns); 
txns = tail (txns) ; 

// update and change updateApplied to be true if there 
// really was an update, 

updateApplied = att . update (txn) II updateApplied; 
\ 

// if no updates were done, then we apply the 
// non-transaction to decay the probability value 
if (! updateApplied) { 
att . nonTransaction ( ) ; 
I 
) 

) 

45 



[0215] The class Attribute also has a new method to apply the non-transaction. This is shown in the revised code 
so fragment below: 
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public class Attribute { 
// an attribute has a value. It 

// also has a name by which it can be referenced. 
private double value; 
private String name; 



public boolean update {Transaction txn) { 

// update the value of the attribute according to 

// the Bayesian method. Return true if the value was updated, 

// or false if the value is not updated. 

// First fetch P(d I x) and P(d \ -x) from the database (804) 
// using the fetch method of the Transaction object. This will 
// go out over the net to get the metadata from a metadata server 
// if it is not cached 
double pdx, pdnx; 
try ( 

pdx - txn. fetch ("pdx", self); 
pdnx = txn. fetch ("pdnx", self); 

// If there is no probability data, the transaction is 
// not relevant to this attribute and no updating is done 
\ catch (NoProbabilityDataException e) {return false; ) 
// otherwise apply Bayes* updating 

value « (pdx * value) / (pdx * value + pdnx * (1 - value)); 
return true; 

\ 

public void nonTransaction {) { 

// update the value of the attribute according to 
// the Bayesian method, using the non-transaction metadata . 
// This will go out over the net to get the metadata from a 
// metadata server if it is not cached 

// fetch index "pntx" is the probability of a non- transact ion 
// given x; 

// "pntnx" is the probability of a non-transaction given not x 
double pntx, pntnx; 
try ( 
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pntx - act . fetch ( "pntx", self); 
pntnx - att . fetch { "pntnx", self); 
5 / / If there is no probabil ity data, the t transaction is 

// not relevant to this actribute and no updating is done 
} catch (NoProbabi 1 i t yDataExcept ion e) {return;) 
// otherwise apply Bayes' updating 
10 value = (pntx * value) / (pntx * value + pntnx * (1 - value)); 

} 

) 

15 [0216] The table 1220 in Fig. 12b shows that again, the ratio of the value of P(n I x) to P(n I ~x) - columns 1222 and 
1 224 - controls the rate at which confidence decays. These values are determined from statistical regression on existing 
data sets. 

[0217] The values of P(d I x) and P(d l~x) are metadata associated with each combination of attribute of the attribute 
vector 808 and transaction owner, and the values of P(n I x) and P(n I ~x) are metadata associated with each element 
20 of the attribute vector 808, and potentially with specific groups of transaction owners. A transaction owner is an entity 
with which the transaction is associated in the database 804, such as a merchant for a financial transaction, or a web 
site for a URL history list. The a priori probability values may be determined by statistical analysis of large amounts of 
blinded data, but are then used to refine specific consumer models. 

[021 8] In these examples, the value of P(d I x) and P(d I ~x) are not shown as having any dependency on the dollar 
25 value of the transaction. This need not be the case. The probabilities can be arbitrary functions of the transaction, 
including the owner of the transaction, the amount of the transaction, and other transaction data. 
[021 9] When the aggregated attributes from the attribute vector 808 are involved, the aggregated value is a weighted 
and normalized sum of a number of attribute values. Figs. 13a and 13b depicts tables showing conditional probability 
metadata for the following transactions: 

30 

• P(s I x) = "Probability of a transaction at Sportsmart occurring given an interest in sports" 

• Assigned a value of 0. 1 based on statistical regression of existing data 

• Shown as column 1306 in table 1302 of Fig. 13a 

35 

• P(s I —x) = "Probability of a transaction at Sportsmart occurring given there is a non-interest in sports" 

• Assigned a value of 0. 02 based on statistical regression of existing data 

• Shown as column 1308 in table 1302 of Fig. 13a 

40 

• P(v j x) = "Probability of a transaction at VolleyBallWorld occurring given an interest in volleyball" 

• Assigned a value of 0.25 based on statistical regression of existing data 

• Shown as column 1310 in table 1302 of Fig. 13a 

45 

• P(n l~x) = "Probability of a transaction at VolleyBallWorld occurring given there is a non-interest in volleyball" 

• Assigned a value of 0.01 based on statistical regression of existing data 

• Shown as column 1312 in table 1302 of Fig. 13a 

50 

• P(n I x) = "Probability of no sports-related transactions occurring given an interest in sports" 

• Assigned a value of 0. 9 based on statistical regression of existing data 

• Shown as column 1314 in table 1302 of Fig. 13a 

55 

• P(n I -x) = "Probability of no sports-related transactions occurring given there is a non-interest in sports" 

• Assigned a value of 0. 98 based on statistical regression of existing data 

• Shown as column 1316 in table 1302 of Fig. 13a 
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[0220] Table 1302 shows the result of three periods of transaction activity. The initial probabilities of interests in the 
various sports (shown in column 1304) are all set to 0.10 in column 1318. The next column, 1320 : shows the values 
of all the probabilities after a transaction at Sportsman, which can not be attributed to any specific-sport, and so has 
a moderate effect on all variables x t through x 3 . The last value in column 1 320 is the value for x, the indicator of general 
5 interest in sports; it is simply the average of the values of x 1 through x 5 . 

[0221] Column 1322 shows how all those probabilities decrease in period 2 : where no sports-related transaction 
occurs. Then, in period 3. a transaction occurs at VolleyBallWorld. This is shown in column 1324. Note that non- 
volleyball sports have had their probabilities reduced, while the volleyball preference increased. The value of x also 
increased significantly. 

w [0222] In Fig. 13b, table 1332 shows the result of three periods of transaction activity involving the same set of 
transactions in a different order. Again, the initial probabilities of interests in the various sports (shown in column 1 334) 
are all set to 0.10 in column 1348. The next column. 1350. shows the values of all the probabilities after a transaction 
at VolleyBallMart, which causes the value of x 4 to increase to 0.202 while all other values in column 1350 decrease 
slightly to 0.009. Again, the last value in column 1350 is the value for x, the indicator of general interest in sports; it is 

'5 the average of the values of x., through x 5 . 

[0223] Column 1352 shows how all probabilities decrease in period 2. where no sports-related transaction occurs. 
Then, in period 3. a transaction occurs at Sportsmart. This is shown in column 1 354. Note that all sports have had their 
probabilities increased, and value of x also has increased significantly. 

[0224] The weighted sum x tracks the general interest in sports, and can be used as an aggregate attribute in the 
20 attribute vector for content providers who wish to target information to consumers interested in sports in general, without 
regard to specific sports. Items such as sports nutrition products and general conditioning information would fall into 
such a category. 

[0225] Content providers who wished to target a specific sport could match their illuminations against the more spe- 
cific sport subcategories. This would be of interest to a merchant advertising a tennis magazine, for example. 

25 [0226] While a preferred embodiment uses Bayesian updating methods, updating of attribute vectors may be done 
with any variety of techniques, including exponential decay, wavelets. Gaussian combination, and the like. 
[0227] Once the concept of time dependence is incorporated into the attribute vector 808 as described above : it is 
apparent that the Boolean Abstractor 81 2 is a very powerful tool for directing targeted content to appropriate consumers. 
Queries on a conventional database (for example, using the SQL language) cannot easily capture a consumer's interest 

30 in a particular area, unless there is a very explicit mapping between specific interests and specific transactions. But in 
practice, it is impossible to provide sufficient mappings between the transactions of a consumer and ail of the possible 
interests and attributes that such transactions may evidence. 

[0228] In the present invention however, a particular value of an attribute in the attribute vector 808 may be derived 
from any number of transactions of very different types that occur over time or behaviors of the consumer. The present 

35 value of the specific attribute depends not only on these interrelated transactions, but also on their time dependence. 
[0229] Interest in golf, for example, may be indicated, by transactions at a golfing-oriented sports store, a vacation 
purchased from a golf-related vacations company, subscriptions to golf magazines, visiting golf-related Web sites, as 
well as by purchases at more general sporting goods stores and travel to cities which have famous golf courses. It 
would be nearly impossible for a content provider to craft a query gotf-interest? that could examine all transactions in 

40 a conventional database and determine the complex combination of supporting or detracting evidence that has accu- 
mulated over time from a series of purchases. Yet in the present invention, a single "golf interest" attribute in the attribute 
vector 808 may be defined, and updated from an arbitrarily complex sequence of transactions and consumer actions, 
thereby capturing in a single attribute the level of the consumer's interest in golf. Accordingly, it then becomes a simple 
matter to construct a Boolean query that determines the value of the golf-interest attribute to see if its value exceeds 

45 a threshold. 

[0230] Combinations of thresholds, such as interest in San Francisco and interest in football, provide even more 
useful targeting via the Boolean Abstractor 812. Such custom queries can be written simply and easily by a content 
provider using the Boolean Abstractor; they would be nearly impossible to write in SQL since the content provider 
would not have any way of determining all of the possible transactions and consumer behaviors that might evidence 
50 an interest in a specific area. 

0. Example Applications of Hierarchical Attribute Vectors 

1 . Content Rotator 

55 

[0231 ] Fig. 1 4 depicts one embodiment of a method for displaying targeted content according to the results of sorting 
a group of illumination candidates. The figure depicts a portion of credit card statement 1402 from MultiBank that is 
delivered electronically to a consumer's computer via a network, such as the Internet, and viewed on the consumer's 
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computer. 

[0232] The credit card statement contains a number of statement lines, each including a date : a merchant identifier, 
advertising space, and an amount. The advertising space is controllable by the merchant to provide an ordered se- 
quence of illuminations, as determined by the Illumination Sorter 816. 

5 [0233] For example, in statement line 1404, the merchant 1406 is bamesandnoble.com. The merchant has some 
advertising space available, which contains content 1 408 (here an advertisements) for "Top 20 list of best selling books 
for your child". This content is from an ordered set of illuminations, which have been included in. or are accessible 
from, the data that is used to construct the credit card statement. This particular content was deemed to be the most 
appropriate for this consumer, as a result of the illumination sorting process. However, there arc other applicable content 

10 of this merchant that were also selected by the illumination process, that is having target vectors or Boolean queries 
that matched attributes or facts of the consumer. However, given the limited space on the credit card statement that 
is allocated to each merchant, there is insufficient space for concurrently displaying all of the content. This problem of 
how to display multiple illuminations is solved by using the content rotator 1410, as indicated by the "star" graphic. 
Other graphical indicia may also be used, such as icon buttons, hyperlinks, changing cursor icons, and the like. 

15 [0234] When the consumer clicks on the content rotator 1410, a new content item may appear: this is shown in Fig. 
15, where content 1508 has now appeared. Additional clicks on the content rotator 1410 will cycle through the selected 
illuminations for this statement line 1 41 4. Various presentation orders may be used to determine the sequence in which 
the illuminations appear. In one embodiment, the content items are presented in random order. Another embodiment 
recognizes the value of screen real estate and thereby presents the content items in the order most likely to interest 

20 the consumer; i.e., in the order as specified by the Illumination Sorter 81 6. Other presentation orders may also be used. 
[0235] The content rotator 1410 is also useful in implementing the illumination policies of the statement provider, 
while still allowing the consumer to receive offers of value via the content items. For example, in Fig. 14, line 1412 of 
the consumer's statement has a blank area 1414 where a content item from the illumination process could have been 
placed. This content item may not have been shown because MultiBank, as owner of the credit card statement, may 

25 have wanted to keep a large fraction of the statement lines unilluminated. Accordingly, MultiBank defined a policy to 
effect this requirement, for example by allowing only less than a certain percentage of statement lines to be illuminated, 
or allowing a selected list of merchants to illuminate their statement lines. But the content rotator 1416 on this line 
informs the consumer that there is at least one content item available if he clicks on the content rotator 1416. 
[0236] When the consumer clicks on the content rotator 1416, the screen changes to look like Fig. 16. In this figure, 

30 the statement line 1412 now has an content 1614 where the blank space used to be. Again, continued clicking on the 
content rotator 1416 will cycle through the selected illuminations for this particular statement line 1412. 
[0237] In each of the foregoing, repeated clicking on the content rotator retrieves the particular selected illuminations 
that are associated with the merchant for that statement line. Thus a single credit card statement, with, for example, 
ten transaction lines, each having a set of five iliuminalions would present 50 illuminations to the consumer, available 

35 on a merchant by merchant basis simply by clicking on the content rotator associated with each statement line. 

[0238] While the foregoing examples of the content rotator are illustrated using a credit card statement with a set of 
merchants, each controlling a portion of the statement, the content rotator may be used in the context of any document 
transmitted to the consumer's computer, with one or more portions of the document associated with a number of po- 
tential illuminations that are selected by the Illuminations Sorter 816. Each such portion may be owned by a different 

40 entity, which specifies the set of potential illuminations for the portion, and hence the underlying content, Boolean query, 
target and relevancy vectors. In this manner, the document, when presented to the consumer, will be customized with 
the selected illuminations from a number of different content providers. Thus, the present invention extends customi- 
zation of information from a single document to an entire hypermedia collection, and thereby extends from the purely 
spatial dimensions of the display device to an additional dimension defined by the relevancy of content to the consumer's 

45 profile. The content rotator provides the navigational tool by which the consumer traverses this customized hypermedia 
collection. 

2. Hierarchical Discrimination of Content 

so [0239] A content provider may have a very rich set of illuminations that may be targeted to very specific types of 
consumers on the basis of their attributes vectors. However, the richness of this set makes it difficult for the content 
provider to select which illuminations to provide in a document being transmitted to a consumer for matching against 
that consumer's profile. 

[0240] For example, Fig. 14 the merchant in line 1404 is a bookseller vending a large variety of different types of 
55 books. Further, assume that the attribute vector 808 includes a "children" aggregate attribute that is an aggregate of 
a number of more specific attributes, such as "infant/preschool," "young school age," "middle schoolers," and "high 
schoolers." 

[0241] However, because the merchant on line 1404 can only send a few candidate illuminations (say, no more than 
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8) due to bandwidth or other limitations, it cannot transmit illuminations that target all the specific categories children's 
books plus all the categories of cookbooks, travel books, and so forth. 

[0242] Therefore, the merchant chooses to make use of the hierarchical structure of the attribute vector to effect a 
hierarchical discrimination of its content. The first illumination candidates transmitted to the consumer's computer cover 
s the broad areas of cooking, travel, sports, automotive, children, crafts, computers, and school. These illuminations will 
be processed by the Illumination Sorter 816 to determine which categories are most relevant to the consumer. The 
most relevant illumination will be displayed initially in the illuminated document, with the selected ones of the remaining 
illumination candidates available to the consumer via the content rotator, and ordered by their relevancy (e.g. match 
scores). 

io [0243] From this list, assume that the illumination for the "children" attribute is selected as the best match with the 
consumer. Indeed it may well be that the consumer has a new infant, but the merchant does not know this fact at the 
time the illumination is selected or presented to the consumer, because, as noted above, the illuminations are selected 
without their provider having any explicit access to the facts or model of the consumer. 

[0244] When the consumer clicks on the content item 1408 ("Top 20 List of best selling books for your child"), the 
15 merchant's server receives an explicit request for children's books. This request is then used to select a more specific 
set of potential illuminations to send to the consumer's computer. In effect, the merchant is now aware of the consumer's 
interest in children's books (as this information has been volunteered by the consumer when he clicked on the content 
item 1408), but still does not know which sub-category of children is appropriate. Thus, the second set of candidate 
illuminations is directed to specific subcategories within the children category (infant/preschool, young school age, 
20 middle schoolers, and high schoolers), and includes target vectors 1 036 containing attributes corresponding to these 
specific sub-categories. The relevancy vectors 1038 is also set to restrict interest to only these attributes. 
[0245] Using this second set of illuminations, the Illumination Sorter 81 6 now matches these target vectors 1 036 to 
the consumer's attribute vector 808. The infant/preschool category is found to have the highest match with the con- 
sumer's attribute vector, and the content shown in Fig. 17 is displayed, here as a separate window, instead of in the 
25 statement line. 

[0246] A second consumer, in a different household may see the same content 1 408 on his statement, and may also 
click on it. But in that consumer's lower level attribute vector 808, the "young school age" sub-category may be the 
best match, so the content which is displayed from all the conditional illuminations is different, as shown in Fig. 18. 
[0247] This process of hierarchical discrimination may be repeated any number of times, thus providing for a very 

30 high degree of targeting without requiring any a priori knowledge about the particular consumer (e.g., without having 
to rent lists of consumer profiles tied to IP addresses, or storing a cookie in the consumer's browser). In addition, the 
consumer's privacy is assured (in particular the details of the database 804 and attribute vector 808), as the only 
information going back to the merchant (or other content provider) is the information contained in the click on the 
content, as performed consciously and willingly by the consumer. 

35 [0248] This discussion has assumed that the content provider has a static set of content items that are to be chosen 
from. However, that need not be the case. Parameterized content could make use of data in the consumer model to 
fill in portions of the content itself. In such as case, content such as content 1408 in Fig. 14 could be tailored to say 
"Top 20 list of best selling books for your infant" or "Top 20 list of best selling books for your young adult where the 
italicized components are selected from the best matching attribute in the attribute vector 808. 

40 [0249] Now considering Fig. 19, there is shown a credit card statement 1 902 after illumination in accordance with 
the present invention. The statement includes a number of transaction lines, such as transaction line 1912, each of 
which includes information fields identifying the date of the transaction, the merchant 1914, and the amount 1908. 
[0250] First, many of the simple text-only merchant names found in conventional statements have been replaced by 
graphic icons 1904 illustrating the merchant's particular tradename or mark. This reinforces brand identity of the mer- 

45 chant right at the point the consumer is reviewing a particular transaction with the merchant. 

[0251] In one aspect, illumination provides for the replacement of this static text, such as the merchant's name, with 
a hyperlink 1 906 to the merchant's web site (or any other web site, as determined by the selectable content alternative 
data). Thus, here illumination transforms the otherwise static credit card statement into a dynamic, structured and 
active document that allows the consumerto immediately access more information about a merchant during the process 

50 of reviewing the credit card statement online. Note that a graphic icon 1904 may also be a hyperlink 1906. 

[0252] The "Amount" column 1907 in the transaction listing in the illuminated statement 1 902 contains hyperlinked 
amounts 1 908. Clicking on a hyperlinked amount 1 908 allows the consumer to view further detail about the transaction, 
as shown in Fig. 20 (to be described shortly). 

[0253] From a marketing perspective, the present invention further provides for the insertion of targeted promotional 
55 content 1910 into specific transaction lines 1912. This targeted content is customized for each individual consumer 
based upon the consumer's own preferences, in accordance with the principles of the illumination process described 
above. 

[0254] The overall layout and placement of the promotional content 1 91 0 relative to the information fields, icons, and 
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hyperlinks has a significant effect on the consumer's perception of the statement 1 902. In accordance with one aspect 
of the present invention, the certain information fields of the transaction, such as the merchant name 1914, are displayed 
in a first location that is viewed prior to other information fields placed in a second location. The promotional content 
1910 is then placed in a location between the first and second locations. 

5 [0255] Normally, during the task of reviewing a non-illuminated statement, the consumer will quickly scan the list of 
transactions, looking for anomalous transactions. This is a very quick process. However when the statement 1902 is 
illuminated and arranged as in Fig. 19, this scanning process is slowed down significantly. First, the consumer takes 
much more time to process an information field at the first location, such as a graphic icon 1 914 depicting a merchant 
rather than a text string (in other words, an image is harder to skip over quickly). 

10 [0256] Second, as is shown in transaction line 1912, when the consumer's eye moves from the merchant name 1 91 4 
to a second information field at a second location, such as the amount 1908, the eye must encounter the promotional 
content 1 91 0 which is displayed between the first and second locations, as it moves across the statement. This process 
of moving the eye from the merchant name 1 91 4 to the amount 1 908 is necessary for the consumer to form a complete 
identification and understanding of the transaction. That is, merely knowing the merchant's name or the amount is 

15 typically insufficient generally to completely identify the transaction. In this cognitive operation of identifying and vali- 
dating the transaction, the specific, targeted promotional content 1910 has been placed between these two elements 
of the transaction, and is thereby visually and cognitively linked with the transaction. In particular, the promotional 
content 1910 is linked with the merchant 1914 since it follows the merchant name's 1914 when viewing from left to 
right. This position facilitates the branding of the targeted content 191 0 with the merchant, exactly at the point where 

20 the transaction is identified by the consumer. 

[0257] In particular, the association between the promotional content 1 910 and the merchant 1 914 may take various 
forms. If the promotional content 1 91 0 is for a discount or other incentive for the merchant of the transaction, then this 
further incentivizes the consumer to patronize the merchant with additional transactions, further establishing the value 
of the merchant in the consumer's mind. If the promotional content 1910 is for a separate merchant, then this creates 

25 a co-branding relationship between the merchant of the transaction and the merchant of the promotion. 

[0258] Fig. 20 shows an example of a pop-up window 2002 that is displayed when the consumer clicks on a hyper- 
linked amount 1908 in Fig. 19, here, the fourth transaction in statement 1902. This window 2002 includes details on 
the store location 2004. and transaction details 2006 such as the date of the transaction, amount, and the reference 
number. Also included in the window 2002 are hyperlinked icon 2008 for emailing the merchant or the bank in the event 

30 of a problem, and hyperlinked icon 2010 for viewing store receipt, if the merchant has it available online. This link 2010 
resolves to a database of receipts held by the merchant (or the statement provider, as appropriate), for example using 
the reference number to obtain the receipt information. 

[0259] Keeping this level of detail in a separate pop-up window 2002, rather than displaying it in the credit card 
transaction statement 1 902, means that the statement 1 902 is visually much simpler and less cluttered. This makes it 
35 easier for the consumer to scan the statement 1 902, and it also provides more opportunity for the consumer to focus 
on other aspects of the statement such as the promotional content 1 91 0. 

[0260] The credit card statement 1 902 as illustrated in Fig. 19 is merely one form of structured financial document 
in which the arrangement principles of the present invention may be implemented. The present invention may be 
applied to other types of financial documents, such as bank statements, brokerage statements, receipts, and the like. 

40 For example, in a bank statement, which typically includes a transaction description such as "check" or "deposit" or 
"cash withdrawal" and an amount, the promotional content 1 91 0 may again be placed in between the locations of these 
items. In a brokerage statement, which typically includes for a transaction a name of a security and an amount of the 
transaction in a single line, the content 1910 may again be placed between these two items, forming an association 
between the security (e.g. identified by a company name) and the content 1910. Likewise, while transaction amount 

45 js a useful information field for purposes of this aspect of the invention, other information fields that are cognitively 
related to the transaction, or to the consumer's focus of interest in reviewing the transaction, may be used. This feature 
is further discussed below. 

[0261] Another way to induce consumers to view promotional content more readily is to provide the transaction data 
in other forms that may be more meaningful or useful. Fig.21 shows one such means of providing data in alternative 

so ways, in the form of a modified credit card statement 21 01 . This statement is similar to statement 1 902, with the addition 
of a "dashboard" 2104 below the transaction line items. This dashboard 2104 is a control device containing a number 
of buttons 2106 (or equivalent user controls) which each links to a different vertical portal. The vertical portals are 
organized by transaction categories, with a specific vertical portal associated with, and adapted to display transactions 
for a specific category; the specific categories can be different for different consumers, and may be determined from 

55 analysis of each consumer's transactions in their consumer database. Clicking on a button 21 06 results in the activation 
of the corresponding vertical portal. 

[0262] Figs. 22-25 illustrate various online financial statements to illustrate the arrangement principles of this aspect 
of the invention. 
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[0263] First, Fig. 22 depicts a vertical portal 2202 for the restaurant category. The intended use of this portal is for 
the consumer to obtain menu information for a particular restaurant, and thus the focus of attention will be ultimately 
be on the menu icon 2206. This particular vertical portal contains lines for each restaurant in the consumer database. 
Each line contains a transaction of the consumer in the category of the portal, with identifying information fields. Par- 
ticular transactions may be further illuminated in accordance with one embodiment present invention. Here, an illumi- 
nated transaction includes a name of a restaurant 2204, which may be hyperlinked or converted to a graphical icon, 
depending on the embodiment, a hyperlinked icon 2206 for viewing a menu for the specific restaurant 2204 (if available, 
e.g. from the restaurant's web server), and an optional item of promotional content 2208. Note however, that the pro- 
motional content 2208 is arranged after the information fields of the transaction. Thus, the user need not attend to the 
content 2208 in order to see and manipulate the icon 2206 of a restaurant's menu. 

[0264] Fig. 23 shows a vertical portal 2302 in another portal which employs the arrangement principles described 
herein, in which the position of the promotional content column and the menu icon column have been switched relative 
to Fig. 22, to position the content 2308 between the restaurant name 2304 and its menu icon 2306. In this case, in 
order to scan from the restaurant 2304 to the menu icon 2306 which is the intended focus of attention, the consumer's 
eye must pass over the promotional content 2308. If the consumer is coming to this portal to view menus, then the 
layout of Fig. 23 provides a higher likelihood that the promotional content will be examined by the consumer. This is 
because the visual scanning process by which the consumer first identifies the restaurant of interest and then looks 
for its menu — thereby completely the cognitive relationship between these two elements of the transaction — must 
visual pass over the targeted content 2308. 

[0265] Fig. 24 depicts a vertical portal 2404 for the air travel segment. Like the restaurant portal of Fig. 22, this 
consists of a number of lines of information gleaned from the consumer database; in this case, there is a line for each 
flight segment 2404 (or alternatively, complete flight) flown by the consumer. For each flight segment, the portal 2404 
shows the lowest fare 2406 available that day to fly the segment. Optional promotional content 2408 may also be 
shown, but as with Fig. 22, the promotional information is not arranged for advantageous use of the consumer's visual 
scanning process. Thus, again the consumer may focus on the fare information 2406 without attending to the promo- 
tional content 2408. 

[0266] In Fig. 25, again there is a vertical portal 2504 for the air travel segment, but here the columns for the pro- 
motional content 2408 and the lowest fare 2406 have been switched relative to Fig. 24. to provide the above described 
type of arrangement of transaction information and promotional content. Again, if the consumer is looking for the lowest 
fare for a particular segment — thereby making the cognitive association between the trip and the fare-then visually 
scanning from the segment description 2504 to the fare 2506 forces the eye to look at the promotional content 2508, 
thereby increasing the likelihood the consumer will examine the promotional content 2508, and forming a cognitive 
association between the promotional content, and the travel transaction. 



Claims 

1. A method of customizing a structured document delivered electronically to a consumer computer of a consumer, 
the consumer computer including a display device, the method comprising: 

receiving a structured document at the consumer computer; 

segmenting the structured document into a plurality of variable content sections, each variable content section 
having a plurality of selectable content alternatives; 

for each variable content section, selecting one of the content alternatives for augmenting the section by 
evaluating the content alternatives with respect to a consumer profile of the consumer, and augmenting the 
section with the selected content alternative; and 

presenting the augmented structured document to the consumer on the display device. 

2. The method of claim 1 , wherein the consumer profile is secured from access by anyone other than the consumer. 

3. The method of claim 1 , wherein the consumer profile is stored locally on a computer of the consumer. 

4. The method of claim 1 , wherein the consumer profile is stored on a server and includes a plurality of consumer 
profiles for a plurality of different consumers, each consumer profile is secured for access only by the individual 
consumer associated with the consumer profile. 

5. The method of claim 1 , wherein the consumer profile includes a consumer model that is a vector of characteristic 
values for a plurality of characteristics of a consumer. 
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6. The method of claim 1 , wherein the consumer profile includes a model of the consumer's interests, preferences, 
or behaviors derived from transactions of the consumer. 

7. The method of claim 1 , wherein the consumer profile includes a database including facts derived from transactions 
5 of the consumer. 

8. The method of claim 1 , further comprising: 

processing a structured document at the consumer computer to identify transactional behavior of the consum- 
10 er; and 

updating the consumer profile with respect to the identified transactional behavior. 

9. The method of claim 8, wherein updating the consumer profile further comprises: 

15 identifying at least one of a merchant or a product in the transactional behavior; and 

updating the consumer profile with respect to the identified a merchant or product. 

10. The method of claim 9, wherein updating the consumer profile further comprises: 

retrieving information about merchants, products, or consumer model parameters over a network from sourc- 
20 es remote to the consumer computer. 

11. The method of claim 10, wherein the information retrieved contains metadata related to structured documents or 
consumer profiles. 

25 12. The method of claim 8, wherein updating the consumer profile further comprises: 

updating the consumer profile as a function of a date or time of a transaction included in the transactional 
behavior. 

13. The method of claim 8, wherein updating the consumer profile further comprises: 

30 updating the consumer profile as a function of an amount of a transaction included in the transactional be- 

havior. 

14. The method of claim 1 , wherein the structured document is an online transaction report. 

35 15. The method of claim 14, wherein the transaction report is one of a group consisting of: an online credit card state- 
ment, an online bank statement, an online bill, an investment portfolio, a medical report, a receipt a file containing 
a list of Uniform Resource Locators, or a tax form. 

16. The method of claim 1 , wherein the transaction report is a financial statement delivered electronically to the con- 
40 sumer, the financial statement including a plurality of financial transactions, at least one financial transaction in- 
cluding a merchant identifier of a merchant of the transaction, the financial transaction forming one of the variable 
content sections. 

17. The method of claim 1 , wherein the content alternatives are encoded as queries to the consumer profile, each 
45 query including an identifier of a source from which the content alternative can be retrieved, and a set of attributes 

for evaluating with respect to the consumer profile. 

1 8. The method of claim 1 , wherein augmenting the a transaction f u rther comprises displaying a graphic representation 
of a coupon for a product in proximity to the merchant identifier of a selected one of the transactions. m 

50 

19. The method of claim 1 , wherein the consumer computer is a set top box. 

20. The method of claim 1 , wherein augmenting a variable content section comprises inserting a coupon into the 
section for display to the consumer. 

55 

21. The method of claim 1 , wherein augmenting a variable content section comprises replacing a text item with a 
graphic image. 
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22. The method of claim 1 , wherein augmenting a variable content section comprises establishing a hyperlink between 
an item of the variable content section and another item of data. 

23. The method of claim 1 , wherein augmenting a variable content section comprises replacing variable information 
with information specifically associated with the consumer. 

24. The method of claim 1 , wherein augmenting a variable content section with a selected content alternative comprises 
retrieving the selected content alternative as hypermedia data from a computer distant to the consumer computer 
and inserting the retrieved hypermedia data into the structured document. 

25. The method of claim 1 , wherein selecting one of the content alternatives for a variable content section comprises: 

comparing a set of attributes for each of the content alternatives with attributes of the consumer defined in the 
consumer profile; and 

selecting the one content alternative having attributes most closely matching attributes of the consumer. 

26. The method of claim 1 , further comprising: 

maintaining a database of content alternatives, each content alternative including a content specifier and a 
selection rule; and 

wherein selecting one of the content alternatives for a variable content section comprises: 

applying the selection rules for each of the content alternatives for the variable content section to the 
consumer profile to determine a degree to which each content alternative is appropriate for the consumer; 
and 

selecting a most appropriate content alternative. 

27. The method of claim 1 , wherein selecting one of the content alternatives for a variable content section comprises: 

comparing a set of attributes for each of the content alternatives with attributes of the consumer defined in the 
consumer profile; and 

selecting the one content alternative having attributes most closely matching attributes of the consumer. 

28. The method of claim 1 , further comprising: 

associating each of a plurality of portions of the structured document with at least one policy for augmenting 
variable content sections within the portion; and 

for each portion of the structured document, augmenting the variable content section in the portion in accord- 
ance with the at one least policy associated with the portion. 

29. The method of claim 1 1 wherein at least one portion of the structured document has an owner of the portion, and 
there is at least one policy associated with the owned portion which is defined by the owner to control the selection 
of the content alternatives for variable content sections within the portion. 

30. The method of claim 1 , wherein the structured document has an owner, and there is a policy associated with the 
structured document and defined by the owner to control the selection of the content alternatives for all variable 
content sections within the structured document. 

31 . A method of customizing an online financial statement delivered electronically to a consumer computer of the 
consumer, the consumer computer including a display device, the financial statement including a plurality of trans- 
actions, the method comprising: 

receiving the financial statement at the consumer computer; 

segmenting the financial statement into a plurality of variable content sections, each variable content section 
having a plurality of selectable content alternatives, wherein at least one transaction of the financial statement 
includes at least variable content section; 

for at least one of the variable content sections included in a transaction, selecting one of the content after- 
natives by evaluating the content alternatives with respect to a consumer profile of the consumer, and aug- 
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meriting the section with the selected content alternative; and 

presenting the augmented financial statement to the consumer on the display device. 

32. The method of claim 31 , wherein the consumer profile is secured from access by anyone other than the consumer. 

5 

33. The method of claim 31 , wherein the consumer profile is stored locally on a computer of the consumer. 

34. The method of claim 31 , wherein the consumer profile is stored on a server and includes a plurality of consumer 
profiles for a plurality of different consumers, each consumer profile is secured for access only by the individual 

10 consumer associated with the consumer profile. 

35. The method of claim 31 , wherein the consumer profile includes a consumer model that is a vector of characteristic 
values for a plurality of characteristics of a consumer. 

15 36. The method of claim 31 , wherein the consumer profile includes a model of the consumer's interests, preferences, 
or behaviors derived from transactions of the consumer. 

37. The method of claim 31 . wherein the consumer prof ile includes a database including facts derived from transactions 
of the consumer. 

20 

38. The method of claim 31 , wherein the financial statement is a credit card statement. 

39. The method of claim 31 , wherein the financial statement is a bank statement. 

25 40. The method of claim 31 , wherein the financial statement is delivered via a set top box. 

41 . The method of claim 31 , wherein augmenting a variable content section comprises inserting a coupon. 

42. The method of claim 31 , wherein augmenting a variable content section for a transaction comprises inserting 
30 content related to the merchant identified in the transaction. 



43. The method of claim 31 , wherein augmenting a variable content section for a transaction comprises inserting 
content related to a merchant other than the merchant identified in the transaction. 



35 44. The method of claim 31 , wherein augmenting a variable content section for a transaction comprises inserting a 
hyperlink to a web site associated with a merchant identified in the transaction. 

45. The method of claim 31 , further comprising: 

40 processing the financial statement at the consumer computer to identify each transaction of the consumer; and 

updating the consumer profile with respect to the identified transactions. 

46. The method of claim 31 , wherein updating the consumer profile further comprises: 

45 identifying at least one of a merchant or a product in a transaction; and 

updating the consumer profile with respect to the identified a merchant or product. 

47. The method of claim 31 , wherein updating the consumer profile further comprises: 

updating the consumer profile as a function of a date or time of a transaction in the financial statement. 

50 

48. A method of targeting information presented to a consumer at a viewing device, comprising: 



storing a consumer profile locally at the viewing device, the consumer profile derived from a plurality of trans- 
actions of the consumer, and describing attributes of the consumer; 
55 receiving at the viewing device a plurality of queries, each query associated with an identifier for retrieving an 

item of information and with a set of attributes descriptive of the item of information; 

evaluating each of the queries at the viewing device with respect to the consumer profile to select the item of 
information with the attributes that most closely matches attributes of the consumer; 
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retrieving the selected item of information using the identifier received in the query associated with the item 
of information; and 

presenting the retrieved item of information to the consumer at the viewing device. 

s 49. The method of claim 48, wherein receiving at the viewing device a plurality of queries comprises receiving a web 
page containing the queries as encoded URLs. 

50. The method of claim 48 , wherein receiving at the viewing device a plurality of queries comprises receiving the 
queries over a television signal. 

10 

51 . A system for customizing a structured document delivered electronically to a consumer computer of a consumer 
the system comprising: 

a consumer profile describing interests of a consumer derived from transactions of the consumer, and securely 
15 stored to prevent access to the consumer profile by anyone other than the consumer; 

an illumination module that segments structured documents received at the consumer computer into a plurality 
of variable content sections, each variable content section having a plurality of selectable content alternatives, 
and for each variable content section, selects one of the content alternatives by evaluating the content alter- 
natives with respect to a consumer profile, and augments the section with the selected content alternative; and 
20 a presentation module that presents the augmented structured document to the consumer. 

52. The system of claim 51 further comprising: 

an interpretation module that processes structured documents to derived information related to transactions 
of the consumer and updates the consumer profile with the derived information. 

25 

53. A method of customizing a structured document delivered electronically to a consumer computer of a consumer 
the consumer computer including a display device, the method comprising: 

receiving a structured document at the consumer computer, the structured document including a plurality of 
30 variable content sections, each variable content section having a plurality of selectable content alternatives; 

for each variable content section, selecting a subset of the content alternatives for augmenting the section by 
evaluating the content alternatives with respect to a consumer profile of the consumer, and ordering the subset 
of content alternatives into an order including a first content alternative, augmenting the section with the first 
content alternative: 

35 presenting the augmented structured document to the consumer on the display device with the first content 

alternative for each variable content section; and 

enabling retrieval of the remaining content alternatives for each variable content section. 



54. The method of claim 53, further comprising: 

40 ordering the selected content alternatives for a variable content section by a function of their relevancy to 

the consumer profile. 

55. The method of claim 53, further comprising: 

presenting, in response to an action by the consumer with respect to one of the variable content sections, 
45 at least one of the remaining content alternatives for the variable content section. 

56. The method of claim 53, further comprising: 

presenting, in response to an action by the consumer with respect to one of the variable content sections, 
at least one of the remaining content alternatives for the variable content section in place of the first content after- 
50 native for the variable content section . 

57. The method of claim 53, wherein evaluating the content alternatives with respect to a consumer profile of the 
consumer further comprises: 

comparing a target vector of attributes associated with a content alternative with a vector of attributes asso- 
55 dated with the consumer profile. 

58. The method of claim 57, further comprising: 
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selecting at least one attribute from the target vector using a relevancy vector comprising relevancy values 
corresponding to at least one attribute of the target vector; and 

comparing only those attributes of the target vector for which the relevancy values are exceed a threshold. 

5 59. The method of claim 58, wherein the relevancy value determines a relative importance of attributes of the target 
vector. 

60. The method of claim 53, wherein evaluating the content alternatives with respect to a consumer profile of the 
consumer further comprises: 

w evaluating a Boolean query with respect to facts derived from transactions of the consumer. 

61 . The method of claim 53, wherein evaluating the content alternatives with respect to a consumer profile of the 
consumer further comprises: 

evaluating a Boolean query with respect to logical abstractions derived from an attribute vector describing 
is attributes of the consumer. 

62. The method of claim 53, wherein the consumer profile includes a consumer attribute vector, comprising: 

a base vector comprising a plurality of base level attributes of the consumer, each base level attribute having 
20 a value; and 

at least one hierarchical vector comprising at least one aggregate attribute associated with a plurality of base 
level attributes of the base level vector, and having a value computed as a function of the values of the asso- 
ciated base level attributes. 

25 63. A method of selecting content for presentation to a consumer, comprising: 

storing a consumer model of the consumer including an attribute vector having a plurality of attributes of the 
consumer; 

receiving a document associated with a plurality of selectable content alternatives, each selectable content 
30 alternative associated with a target vector representing expected attributes of a consumer; 

comparing the attribute vector of the consumer with at least one of the target vectors to determine a distance 
measure between the target vectors and the attribute vector; 

selecting at least one of the content alternatives as a function of the distance measure; and 
presenting the selected at least content alternative to the consumer. 

35 

64. The method of claim 63, further comprising: 

receiving in association with each target vector a relevancy vector including a plurality of relevancy values, 
each relevancy value associated with at least one attribute of the target vector, the relevancy value indicating 
40 a measure of relevancy of the associated attribute of the target vector; 

scaling the attributes of the target vector by the relevancy value associated with each attribute prior to com- 
paring the target vector to the attribute vector. 

65. The method of claim 63, further comprising: 

45 

receiving in association with at least one of the plurality of selectable content alternatives a query with respect 
to at least one attribute of the attribute vector of the consumer; and 

evaluating the query against the attribute vector of the consumer to determine whether the content alternative 
may be selected for presentation to the consumer. 

50 

66. The method of claim 65, wherein the query includes a priority value, and selecting at least one of the content 
alternatives as a function of the distance measure further comprises: 

computing a match score for each content alternative as a function of the distance measure between the target 
55 vector associated with the content alternative and the attribute vector of the consumer, and the priority value 

of a query associated with the content alternative; and 
selecting the content alternative with a highest match score. 
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67. A method of selecting content for presentation to a consumer, the consumer associated with a consumer model 
describing attributes of the consumer, the content delivered electronically to a consumer computer of a consumer, 
the consumer computer including a display device, the consumer associated with a consumer profile including a 
plurality of attributes of the consumer including at least on aggregated attributed having a value computed from a 
5 plurality of other attributes, the method comprising: 

receiving the structured document at the consumer computer, the structured document including a variable 
content section having a first set of content alternatives; selecting a first content alternative for augmenting 
the section by evaluating attributes of the content alternatives as a function of corresponding attributes of the 
10 consumer profile of the consumer; 

presenting the structured document with the first content alternative on the display device: 
receiving a consumer action indicating selection of the first content alternative; 

sending in response to the consumer action a second set of variable content alternatives, including a content 
alternative having an attribute that contributes to an aggregate attribute of the first content alternative; 
15 selecting a second content alternative by evaluating the attributes of second set of content alternatives as a 

function of corresponding attributes of the consumer profile of the consumer; and presenting the structured 
document with the second content alternative on the display device. 
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